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Preface for the Instructor 


A course on ordinary differential equations is a standard part of the under- 
graduate mathematics curriculum throughout the world. Such a course often 
includes some introductory material on Fourier series and a few applications. 

But the world is changing. Because of the theory of wavelets, Fourier 
analysis is ever more important and central. And applications are a driving 
force behind all of mathematics. 

Thus it is appropriate to have a text that presents a more balanced pic- 
ture. The text should have differential equations (both ordinary and partial), 
Fourier analysis, and applications in equal measure and with equal weight. 
This is such a text. 

Certainly a sophomore-level course on differential equations can be taught 
from this book. Also an undergraduate-level course on Fourier analysis can be 
taught from this book. And the copious and substantial applications that we 
provide enrich both those points of view. 

While this is a substantive book, I should stress that the text does not 
assume that the student knows the Lebesgue integral. The Riemann integral 
is used throughout. And we also do not assume that the student knows any 
functional analysis. Both Lebesgue measure theory and functional analysis are 
graduate-level topics, and inappropriate in the present context. 

We likewise do not assume that the student has had a course in under- 
graduate real analysis—as from the books of Rudin or Krantz or the author’s. 
We intend this book for a broad audience of mathematics and engineering and 
physics students. 

To make the book timely and exciting, we include a substantial chapter on 
basic properties of wavelets, with applications to signal processing and image 
processing. This should give students and instructors alike a taste of what is 
happening in the subject today. 

Since this is a textbook, we present copious examples. There are a great 
many figures—just because the subject of analysis, properly viewed, is quite 
visual. And there are substantive exercise sets. The text also contains on-the- 
fly exercises which should cause the student to pick up his/her pencil and 
do some calculations. Each chapter ends with a special collection of exercises 
(called Problems for Review and Discovery) that ties together the ideas in 
the chapter. There is a collection of Drill Exercises, a collection of Challenge 
Problems (problems which require some thought), and a collection of Problems 
for Discussion and Exploration (problems that are suitable for group work). 
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xii PREFACE FOR THE INSTRUCTOR 


Of course there are solutions to selected exercises provided at the end of 
the book. We include also a Glossary and a Table of Notation. 

It is a pleasure to thank the book’s many insightful reviewers, who con- 
tributed numerous ideas and suggestions. Ken Rosen gave particularly cogent 
and detailed advice which I appreciate very much. I also thank my editor Bob 
Ross for his support and encouragement. 

It is hoped that this text will find an enthusiastic audience, both among 
students and instructors. We look forward to hearing from the readership as 
the book goes into use. 


Steven G. Krantz 
St. Louis, Missouri 


Preface for the Student 


The purpose of this book is to teach you the interrelationships among differ- 
ential equations, Fourier analysis, and wavelet theory. This is a lot of territory 
to cover, and the purpose of this brief Preface is to give you some guidance in 
the process. 

Differential equations are the language of science. Most of the laws of 
nature are formulated in the language of differential equations. If you want to 
be an engineer, or a physicist, or a mathematician, or even a biologist, then 
you should learn to speak differential equations. This book will set you on 
that road. 

Fourier analysis is one of our most powerful tools for analyzing functions. 
In Fourier analysis, we break a given function up into component parts— 
usually sines and cosines. And we can understand the structure of the function 
using this tool. A modern aspect of Fourier analysis is wavelet theory, which 
allows us to replace sines and cosines by more general core objects that are 
tailored to the problem at hand. Wavelet theory has revolutionized the prac- 
tice of image processing, signal processing, and many other parts of modern 
technology. 

Working with this book is a real intellectual adventure. It should help you 
to learn and to grow as a mathematical scientist. We wish you all the best in 
your journey. 


Steven G. Krantz 
St. Louis, Missouri 
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What Is a Differential Equation? 


e The concept of a differential equation 
e Characteristics of a solution 

e Finding a solution 

e Separable equations 

e First-order linear equations 

e Exact equations 


e Orthogonal trajectories 


a 


1.1 Introductory Remarks 


A differential equation is an equation relating some function f to one or more 
of its derivatives. An example is 
d? f 


dx? 


d, 
(0) +224 (a) + (0) =sing. (1.1.1) 

x 
Observe that this particular equation involves a function f together with 
its first and second derivatives. Any given differential equation may or may 
not involve f or any particular derivative of f. But, for an equation to be a 
differential equation, at least some derivative of f must appear. The objective 
in solving an equation like (1.1.1) is to find the function f. Thus we already 
perceive a fundamental new paradigm: When we solve an algebraic equation, 
we seek a number or perhaps a collection of numbers; but when we solve a 
differential equation we seek one or more functions. 

As a simple example, consider the differential equation 
yay. 

It is easy to determine that any function of the form y = Ce”® is a solution 
of this equation. For the derivative of the function is equal to itself. So we 
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see that the solution set of this particular differential equation is an infinite 
family of functions parametrized by a parameter C’. This phenomenon is quite 
typical of what we will see when we solve differential equations in the chapters 
that follow. 

Many of the laws of nature—in physics, in engineering, in chemistry, in 
biology, and in astronomy—find their most natural expression in the lan- 
guage of differential equations. Put in other words, differential equations are 
the language of nature. Applications of differential equations also abound in 
mathematics itself, especially in geometry and harmonic analysis and model- 
ing. Differential equations occur in economics and systems science and other 
fields of mathematical science. 

It is not difficult to perceive why differential equations arise so readily in 
the sciences. If y = f(a) is a given function, then the derivative df/dx can 
be interpreted as the rate of change of f with respect to x. In any process of 
nature, the variables involved are related to their rates of change by the basic 
scientific principles that govern the process—that is, by the laws of nature. 
When this relationship is expressed in mathematical notation, the result is 
usually a differential equation. 

Certainly Newton’s law of universal gravitation, Maxwell’s field equa- 
tions, the motions of the planets, and the refraction of light are important 
examples which can be expressed using differential equations. Much of our 
understanding of nature comes from our ability to solve differential equations. 
The purpose of this book is to introduce you to some of these techniques. 

The following example will illustrate some of these ideas. According to 
Newton’s second law of motion, the acceleration a of a body of mass m is 
proportional to the total force F acting on the body. The standard expression 
of this relationship is 

F=m.-a. (1.1.2) 


Suppose in particular that we are analyzing a falling body. Express the 
height of the body from the surface of the Earth as y(t) feet at time t. The 
only force acting on the body is that due to gravity. If g is the acceleration 
due to gravity (about —32 ft./sec.? near the surface of the Earth) then the 
force exerted on the body has magnitude m-g. And of course the acceleration 
is d?y/dt?. Thus Newton’s law (1.1.2) becomes 


d*y 
m-g=m-ay (1.1.3) 
or 
_&y 
9 We 


We may make the problem a little more interesting by supposing that 
air exerts a resisting force proportional to the velocity. If the constant of 
proportionality is k, then the total force acting on the body is mg —k-(dy/dt). 
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Then equation (1.1.3) becomes 
dy dy 
_— =m: — . 
dt dt? 
Equations (1.1.3) and (1.1.4) express the essential attributes of this physical 
system. 
A few additional examples of differential equations are these: 


(1.1.4) 


a d 
(1-2?) 9 — 22 + p(p+1)y =0; (1.1.5) 
d? d 
PLY 4 A 4 (2 — py =0; (1.1.6) 
d2 
(1—a)y" — ay’ + p?y = 0; 
a =k-y; (1.1.10) 
XL 
a3 . 
wat ($*) =y+sinz. (1.1.11) 


Equations (1.1.5)—(1.1.9) are called Legendre’s equation, Bessel’s equa- 
tion, Airy’s equation, Chebyshev’s equation, and Hermite’s equation, respec- 
tively. Each has a vast literature and a history reaching back hundreds of 
years. We shall touch on each of these equations later in the book. Equation 
(1.1.10) is the equation of exponential decay (or of biological growth). 


Math Nugget 


Adrien Marie Legendre (1752-1833) invented Legendre 
polynomials (the artifact for which he is best remembered) 
in the context of gravitational attraction of ellipsoids. Leg- 
endre was a fine French mathematician who suffered the 
misfortune of seeing most of his best work—in elliptic in- 
tegrals, number theory, and the method of least squares— 
superseded by the achievements of younger and abler men. 
For instance, he devoted forty years to the study of elliptic 
integrals, and his two-volume treatise on the subject had 
scarcely appeared in print before the discoveries of Abel 
and Jacobi revolutionized the field. Legendre was remark- 
able for the generous spirit with which he repeatedly wel- 
comed newer and better work that made his own obsolete. 
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Each of equations (1.1.5)—(1.1.9) is of second order, meaning that the 
highest derivative that appears is the second. Equation (1.1.10) is of first order. 
Equation (1.1.11) is of third order. Each equation is an ordinary differential 
equation, meaning that it involves a function of a single variable and the 
ordinary derivatives of that function. 

A partial differential equation is one involving a function of two or more 
variables, and in which the derivatives are partial derivatives. These equations 
are more subtle, and more difficult, than ordinary differential equations. We 
shall say something about partial differential equations in Chapter 10. 


Math Nugget 


Friedrich Wilhelm Bessel (1784-1846) was a distinguished 
German astronomer and an intimate friend of Gauss. The 
two corresponded for many years. Bessel was the first man 
to determine accurately the distance of a fixed star (the 
star 61 Cygni). In 1844 he discovered the binary (or twin) 
star Sirius. The companion star to Sirius has the size of a 
planet but the mass of a star; its density is many thousands 
of times the density of water. It was the first dead star to 
be discovered, and occupies a special place in the modern 
theory of stellar evolution. 


Dc 


1.2 A Taste of Ordinary Differential Equations 


In this section we look at two very simple examples to get a notion of what 
solutions to ordinary differential equations look like. 


EXAMPLE 1.2.1 Let us solve the differential equation 
y =a“. 


Solution: This is certainly an equation involving a function and some of its 
derivatives. It is plain to see, just intuitively, that a solution is given by 


ae 
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But that is not the only solution. In fact the general solution to this differential 
equation is 


x 
= —4+C 
yaa GO: 


for an arbitrary real constant C. i) 


So we see that the solution to this differential equation is an infinite family 
of functions. This is quite different from the situation when we were solving 
polynomials—when the solution set was a finite list of numbers. 

The differential equation that we are studying here is what we call first 
order—simply meaning that the highest derivative that appears in the equa- 
tion is one. So we expect there to be one free parameter in the solution, and 
indeed there is. 


EXAMPLE 1.2.2 Let us solve the differential equation 

y =y. 
Solution: Just by intuition, we see that y = e” is a solution of this ODE. But 
that is not the only solution. In fact 

y = Ce 


is the general solution. i) 


It is curious that the arbitrary constant in the previous example occurred 
additively, while the constant in this solution occurred multiplicatively. Why 
is that? 

Here is another way that we might have discovered the solution to the 
differential equation in Example 1.2.2. Write the problem in Leibniz notation 
as 


dy _ 

de 
Now manipulate the symbols to write this as 

dy = ydx 
or i 

oY = de 

y 


(At first it may seem odd to manipulate dy/dx as though it were a fraction. 
But we are simply using the shorthand dy = (dy/dx)dz.) 
We can integrate both sides of the last equality to obtain 


Iny=2+C 
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or 
y=e"-eF =D-e*. 


Thus we have rediscovered the solution to this ODE, and we have rather 
naturally discovered that the constant occurs multiplicatively. 


a 


1.3. The Nature of Solutions 


An ordinary differential equation of order n is an equation involving an un- 
known function f together with its derivatives 


df df d” f 
dx? dx?’ ""? dx” ° 
We might, in a more formal manner, express such an equation as 
af af d” f 
va ei | 
(«. Y, dx d dx2 v3 y] dx” 0 


How do we verify that a given function f is actually the solution of such an 
equation? 

The answer to this question is best understood in the context of concrete 
examples. 


EXAMPLE 1.3.1 Consider the differential equation 
y’ —5y' + 6y =0. 


Without saying how the solutions are actually found, verify that y;(x) = e?” 


and y2(x) = e?” are both solutions. 


Solution: To verify this assertion, we note that 


yf —5y, +6y. = 2-2-e7* 5-2-6? 46-e% 
= |[4-—10+6]-e7* 
= 0 


and 
ys — Sy, + Gyo = 3-3-8? —5-3- 627 +6- 63% 


= (9 -15+6]-e* 
=0. | 
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This process of verifying that a given function is a solution of the given 
differential equation is entirely new. The reader will want to practice and 
become accustomed to it. In the present instance, the reader may check that 
any function of the form 


y(x) = cye?* + cpe** (1.3.1) 


(where ci, cg are arbitrary constants) is also a solution of the differential 
equation. 

An important obverse consideration is this: When you are going through 
the procedure to solve a differential equation, how do you know when you 
are finished? The answer is that the solution process is complete when all 
derivatives have been eliminated from the equation. For then you will have 
y expressed in terms of x, at least implicitly. Thus you will have found the 
sought-after function or functions. 

For a large class of equations that we shall study in detail in the present 
book, we shall find a number of “independent” solutions equal to the order of 
the differential equation. Then we shall be able to form a so-called “general 
solution” by combining them as in (1.3.1). Picard’s existence and uniqueness 
theorem tells us that our general solution is complete—there are no other 
solutions. 

Sometimes the solution of a differential equation will be expressed as an 
implicitly defined function. An example is the equation 


dy y? 

4 = 1.3.2 

dx l-—ay’ ( ) 
which has solution 

sy =Inyt+c. (1.3.3) 


Note here that the hallmark of what we call a solution is that it has no 
derivatives in it: it is a direct formula, relating y (the dependent variable) 
to x (the independent variable). To verify that (1.3.3) is indeed a solution of 
(1.3.2), let us differentiate: 


d d 
aa = 
7 tY qin 4 
hence ‘ dul 
L-yto-—= uae 
y 
or d 1 
y —_ = 
dx ¢ *) " 
In conclusion, 
dy y? 
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as desired. 

One unifying feature of the two examples that we have now seen of ver- 
ifying solutions is this: When we solve an equation of order n, we expect 
n “independent solutions” (we shall have to say later just what this word 
“independent” means) and we expect n undetermined constants. In the first 
example, the equation was of order 2 and the undetermined constants were c1 
and cg. In the second example, the equation was of order 1 and the undeter- 
mined constant was c. 


Math Nugget 


Sir George Biddell Airy (1801-1892) was Astronomer Royal 
of England for many years. He was a hard-working, sys- 
tematic plodder whose sense of decorum almost deprived 
John Couch Adams of credit for discovering the planet Nep- 
tune. As a boy, Airy was notorious for his skill in designing 
peashooters. Although this may have been considered to be 
a notable start, and in spite of his later contributions to the 
theory of light (he was one of the first to identify the medical 
condition known as astigmatism), Airy seems to have devel- 
oped into an excessively practical sort of scientist who was 
obsessed with elaborate numerical calculations. He had little 
use for abstract scientific ideas. Nonetheless, Airy functions 
still play a prominent role in differential equations, special 
function theory, and mathematical physics. 


EXAMPLE 1.3.4 Verify that, for any choice of the constants A and B, the 
function 
y= 2° + Ae*+ Be-* 
is a solution of the differential equation 
y’ —y=2-2'. 


Solution: This solution set is typical of what we shall learn to find for a 
second-order linear equation. There are two free parameters in the solution 
(corresponding to the degree 2 of the equation). Now, if y = 27+ Ae*+ Be~*, 
then 

y’ = 2xa+ Ae” — Be * 


and 
y” =2+ Ae*>+ Be". 
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Hence 
y" == [2 + Ae” + Be*| — Es + Ae” + Be~*] =—9- 2 


as required. o 


EXAMPLE 1.3.5 One of the useful things that we do in this subject is to use an 
“initial condition” to specify a particular solution. Often this initial condition 
will arise from some physical consideration. 

For example, let us solve the (very simple) problem 


y =2y, y(O)=1. 


Solution: Of course the general solution of this differential equation is y = 
C - e?”, We seek the solution that has value 1 when x = 0. So we set 


70), 


This gives 
1=C. 


So we find the particular solution 


y= e*, | 


REMARK 1.3.6 One of the powerful and fascinating features of the study of 
differential equations is the geometric interpretation that we can often place on 
the solution of a problem. This connection is most apparent when we consider 
a first-order equation. Consider the equation 


dy _ 


an F(a,y). (1.3.6.1) 


We may think of equation (1.3.6.1) as assigning to each point (z, y) in the plane 
a slope dy/dx. For the purposes of drawing a picture, it is more convenient 
to think of the equation as assigning to the point (a, y) the vector (1, dy/dx). 
See Figure 1.1. Figure 1.2 illustrates how the differential equation 


dy 
Te 
assigns such a vector to each point in the plane. Figure 1.3 illustrates how the 
differential equation 
dy _ 
a 
assigns such a vector to each point in the plane. 
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FIGURE 1.1 
A vector field. 
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FIGURE 1.3 
The vect d for dy/dx = —y 


xX 
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Exercises 


1. Verify that the following functions (explicit or implicit) are solutions of 
the corresponding differential equations. 


(a) y=a2? +c y! = 2a 

(b) y=cx” xy’ = 2y 

(e) “y= e* $e yy’ =e" 

(d) y=ce™ y' =ky 

(e) y=csin2zx + co cos 2x y’ +4y =0 

(f) y=cie™ +c2e7*” y” —4y =0 

(g) y=ci sinh 2z + c2 cosh 2x y”’ —4y=0 

(h) y =arcsin zy sy ty=yVJV1—22y 
(i) y=axtanz xy’ =ut x+y? 

(j) a? = 2y"Iny as oe 

(k) y=2?-cx Qeyy =a +y? 

(I) ysete/a y tay! = 0*(y')? 

(m) y=ce¥/* y' = y?/(xy — 2”) 

(n) y+siny=a2 (ycosy—siny+2)y’=y 
(o) x+y =arctany 1+y?+y’y’ =0 


2. Find the general solution of each of the following differential equations. 


(a) yf=e™—2 (f) ay=1 

(b) y' = xe" (g) y’ =arcsine 

(c) (l4+2)y=2 (h) y’sing =1 

(d) (l+a?)y’ =a (i) (L+2°)y' =a 

(e) (1+a?)y’=arctane (j) (a? —3r42)y'=2 


3. For each of the following differential equations, find the particular solu- 
tion that satisfies the given initial condition. 


(a) y' =<2e* y=3 whenz=1 
(b) y’ =2sinzcosz y = 1 when x =0 
(c) y =Inz y = 0 when zt =e 
(d) (@?—1)y'=1 y = 0 when x = 2 
(e) a(x? —4)y' =1 y =0 when x= 1 
(f) (a@+1)(@? +1)y’ = 2a? +2 y = 1 when xz =0 


4. Show that the function 


2 [* 2 
y=e" / e* dt 
0 


is a solution of the differential equation y’ = 2ry + 1. 
5. For the differential equation 


y” — 5y'+ 4y =0, 


carry out the detailed calculations required to verify these assertions: 
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x 


(a) The functions y = e* and y = e*” are both solutions. 


(b) The function y = cie” +c2e*” is a solution for any choice of constants 
C1, C2. 
6. Verify that 2?y = ny+cis a solution of the differential equation dy/dx = 
2ay”/(1 — xy) for any choice of the constant c. 
7. For which values of m will the function y = ym = e”” be a solution of 
the differential equation 


mw 


ay!” + y" — Sy’ +2y=0? 


Find three such values m. Use the ideas in Exercise 5 to find a solution 
containing three arbitrary constants c1,c2,c3. 


(I 
1.4  Separable Equations 


In this section we shall encounter our first general class of equations with the 
properties that 


(i) we can immediately recognize members of this class of equations, 
and 


(ii) we have a simple and direct method for (in principle) solving such 
equations. 


This is the class of separable equations. 

A first-order ordinary differential equation is separable if it is possible, 
by elementary algebraic manipulation, to arrange the equation so that all the 
dependent variables (usually the y variable) are on one side of the equation 
and all the independent variables (usually the x variable) are on the other 
side of the equation. Let us learn the method by way of some examples. 


EXAMPLE 1.4.1 Solve the ordinary differential equation 
y’ = 2ry. 


Solution: In the method of separation of variables—which is a method for 
first-order equations only—it is useful to write the derivative using Leibniz 
notation. Thus we have 

dy 


— =2ry. 
dx a 


We rearrange this equation as 


d 
a = 2adz. 
y 
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d 
(It should be noted here that we use the shorthand dy to stand for = dx and 


we of course assume that y 4 0.) 
Now we can integrate both sides of the last displayed equation to obtain 


[oe [reac 
y 


We are fortunate in that both integrals are easily evaluated. We obtain 
In|y| = 2? +C. 


(It is important here that we include the constant of integration.) Thus 


We may rewrite this as 


y=e-e™ = De* . (1.4.1.1) 
| 


Notice two important features of our final representation for the solution: 


(i) We have re-expressed the constant e° as the positive constant D. 
We will even allow D to be negative, so we no longer need to worry 
about the absolute values around y. 


(ii) Our solution contains one free constant, as we may have anticipated 
since the differential equation is of order 1. 


We invite the reader to verify that the solution in equation (1.4.1.1) ac- 
tually satisfies the original differential equation. 


REMARK 1.4.2 Of course it would be foolish to expect that all first-order 
differential equations will be separable. For example, the equation 


dy 2 2 
a TY 


certainly is not separable. The property of being separable is rather special. 
But it is surprising that quite a few of the equations of mathematical physics 
turn out to be separable (as we shall see later in the book). 


EXAMPLE 1.4.3 Solve the differential equation 


zy’ = (1 — 22”) tany. 
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Solution: We first write the equation in Leibniz notation. Thus 


x 


Separating variables, we find that 


1 
cot y dy = € - 20) dx. 
x 


Applying the integral to both sides gives 


1 
footudy = [= -20dz 


In|sin y| = In|2|—2?+C. 


or 


Again note that we were careful to include a constant of integration. 


We may express our solution as 


2 
sin y = elne—2 +C 


or 5 
siny=D-xz-e* 


The result is : 
y =sin* (D-a-e* ) : 


15 


We invite the reader to verify that this is indeed a solution to the given dif- 


ferential equation. 


REMARK 1.4.4 Of course the technique of separable equations is one that is 
specifically designed for first-order equations. It makes no sense for second- 
order equations. Later in the book we shall learn techniques for reducing a 
second-order equation to a first-order; then it may happen that the separation- 


of-variables technique applies. 


a 


Exercises 


1. Use the method of separation of variables to solve each of these ordinary 


differential equations. 


(a) wv y’+y?=0 (f) xy’ = (1— 42?) tany 
(b) y' = 4ay (g) y’siny = 2° 
(c) y’+ytanz =0 (h) y'—ytanz =0 


(d) (1+2?)dy+(1 
y 


y)de=0 = (i) ayy’ =y-1 
(e) ylnydxr—axdy=0 


(i) 2y’—y'e? =0 
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2. For each of the following differential equations, find the particular solu- 
tion that satisfies the additional given property (called an initial condi- 


tion). 
(a) yy=ar+1 y =3 when x = 1 
(b) (dy/dx)x? = y y = 1 when « =0 

x 

y x 
(c) +m y y = 3 when x= 1 
(d) yy =x+2 y = 4 when «= 0 
(e) y= wy? y = 2 when x = -1 
(f) y/(+y)=1-2? y = —2 when x = —1 


3. For the differential equation 


make the substitution y’ = p, y’” = p’ to reduce the order. Then solve 
the new equation by separation of variables. Now resubstitute and find 
the solution y of the original equation. 


4. Use the method of Exercise 3 to solve the equation 
yy’ =2(1+2) 


subject to the initial conditions y(0) = 1, y’(0) = 2. 


a 


1.5 First-Order, Linear Equations 


Another class of differential equations that is easily recognized and readily 
solved (at least in principle), is that of first-order, linear equations. 
An equation is said to be first-order linear if it has the form 


y +a(x)y = W(x). (1.5.1) 


The “first-order” aspect is obvious: only first derivatives appear in the equa- 
tion. The “linear” aspect depends on the fact that the left-hand side involves a 
differential operator that acts linearly on the space of differentiable functions. 
Roughly speaking, a differential equation is linear if y and its derivatives are 
not multiplied together, not raised to powers, and do not occur as the argu- 
ments of functions. For now, the reader should simply accept that an equation 


1We throw in this caveat because it can happen, and frequently does happen, that we 
can write down integrals that represent solutions of our differential equation, but we are 
unable to evaluate those integrals. This is annoying, but there are numerical techniques that 
will address such an impasse. 
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of the form (1.5.1) is first-order linear, and that we will soon have a recipe for 
solving it. 

As usual, we learn this new method by proceeding directly to the exam- 
ples. 


EXAMPLE 1.5.2 Consider the differential equation 
y +2ry=2. 
Find a complete solution. 


Solution: First note that the equation definitely has the form (1.5.1) of a 
linear differential equation of first order. In particular, a(a) = 2x and b(x) = a. 
So we may proceed. 

We endeavor to multiply both sides of the equation by some function that 
will make each side readily integrable. It turns out that there is a trick that 
always works: You multiply both sides by el ocelae, 

Like many tricks, this one may seem unmotivated. But let us try it out 
and see how it works. Notice that a(x) = 2a so that 


fear = freae =". 


(At this point we could include a constant of integration, but it is not nec- 
essary.) Thus eS az) dx — gt”, Multiplying both sides of our equation by this 
factor gives 


2 


2 
‘+e .Iry=e" +x 


e -yt+e 


/ 
(<*-y) =r-e". 


It is the last step that is a bit tricky. For a first-order linear equation, it 
is guaranteed that, if we multiply through by e/ “() 4", then the left-hand side 
of the equation will end up being the derivative of [ef “(” 4 -y]. Now of course 
we integrate both sides of the equation: 


/ 
[(e-) d= fee de, 


We can perform both the integrations: on the left-hand side we simply ap- 
ply the fundamental theorem of calculus; on the right-hand side we do the 
integration as usual. The result is 


or 


or 
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Observe that, as we usually expect, the solution has one free constant (be- 
cause the original differential equation was of order 1). We invite the reader 
to check in detail that this solution actually satisfies the original differential 
equation. a 


REMARK 1.5.3 The last example illustrates a phenomenon that we shall en- 
counter repeatedly in this book. That is the idea of a “general solution.” 
Generally speaking, a differential equation will have an entire family of solu- 
tions. And, especially when the problem comes from physical considerations, 
we shall often have initial conditions that must be met by the solution. The 
family of solutions will depend on one or more parameters (in the last example 
there was one parameter C), and those parameters will often be determined 
by the initial conditions. 

We shall see as the book develops that the amount of freedom built into the 
family of solutions—that is, the number of degrees of freedom provided by the 
parameters—meshes very nicely with the number of initial conditions that fit 
the problem (in the last example, one initial condition would be appropriate). 
Thus we shall generally be able to solve uniquely for numerical values of 
the parameters. Picard’s Existence and Uniqueness Theorem gives a precise 
mathematical framework for the informal discussion in the present remark. 


Summary of the Method 


To solve a first-order linear equation 


y' + a(x)y = W(x) , 


multiply both sides of the equation by the “integrating fac- 
tor” ef (*) 4" and then integrate. 


EXAMPLE 1.5.4 Solve the differential equation 
ay! + xy = ae 


Solution: First observe that this equation is not in the standard form (equa- 
tion (1.5.1)) for first-order linear. We render it so by multiplying through by 
a factor of 1/x?. Thus the equation becomes 


eid 
Uae. 
x 
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Now a(x) = 1/z, f a(x) dx = In|a|, and eJ x) dx — |r], We multiply 
the differential equation through by this factor. In fact, in order to simplify 
the calculus, we shall restrict attention to x > 0. Thus we may eliminate the 
absolute value signs. 


Thus 


ry +y=2". 


Now, as is guaranteed by the theory, we may rewrite this equation as 


(2-y) =2. 


Applying the integral to both sides gives 


[ (eu) dee f2rae. 


Now, as usual, we may use the fundamental theorem of calculus on the 
left; and we may simply integrate on the right. The result is 


We finally find that our solution is 


- x? i C 
amet Ne 
You should plug this answer into the differential equation and check that it 
works. | 


TT 


Exercises 


1. Find the general solution of each of the following first-order, linear ordi- 
nary differential equations. 


(a) y’—ay=0 (Ec ricey 0. 

(b) a gee (g) sy'—3y=a 

(c) yty= To oe (h) (14+ 27) dy + 2ay dx = cot x dz 
(d) y ty=2re7* +2? (i) y' +ycotx = 2xrcscex 

(e) (2y— 2?) dx =axdy Gj) y-x+aycotxz+a2y’ =0 


2. For each of the following differential equations, find the particular solu- 
tion that satisfies the given initial data. 
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(a) y’-azy=0 y =3 when x = 1 
(b) y! — xy = bre” y=1whenz=1 
(c) xy’ +y= 32? y = 2 when x = 2 
(d) y'—(1/a)y=2? y=3whenz=1 
(e) y +4y=e* y = 0 when x = 0 
(f)  2?y' + ay = 2x y=1whenz#=1 


The equation 

4 Play =Q(a)y" 

dx 

is known as Bernoulli’s equation. It is linear when n = 0 or 1, otherwise 
not. In fact the equation can be reduced to a linear equation when n > 1 
by the change of variables z = y'~”. Use this method to solve each of 
the following equations. 


(a) wy’ ty=cxty? (c) adyt+ydzx = xy? dx 
(b) zyy' +y? = axcose (d) y! +ay = ay" 


The usual Leibniz notation dy/dx implies that x is the independent vari- 
able and y is the dependent variable. In solving a differential equation, it 
is sometimes useful to reverse the roles of the two variables. Treat each 
of the following equations by reversing the roles of y and a: 


(a) (e% —2zy)y’ = y? (c) ay’ = =a°(y—1)y’ 
(b) y—ay’ =y'y?e" (d) ees + 3f(y)f (y)@ = f'(y) 


We know from our solution technique that the general solution of a first- 
order linear equation is a family of curves of the form 


y=c- f(a) +9(2). 


Show, conversely, that the differential equation of any such family is linear 
and first order. 


Show that the differential equation y’ + Py = Qylny can be solved by 
the change of variables z = In y. Apply this method to solve the equation 


ry = Qe*y + ylny. 


One solution of the differential equation y’ sin 2x = 2y + 2cosz remains 
bounded as x — 7/2. Find this solution. 

A tank contains 10 gallons of brine in which 2 pounds of salt are dissolved. 
New brine containing 1 pound of salt per gallon is pumped into the tank 
at the rate of 3 gallons per minute. The mixture is stirred and drained 
off at the rate of 4 gallons per minute. Find the amount x = x(t) of salt 
in the tank at any time t. 

A tank contains 40 gallons of pure water. Brine with 3 pounds of salt 
per gallon flows in at the rate of 2 gallons per minute. The thoroughly 
stirred mixture then flows out at the rate of 3 gallons per minute. 
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(a) Find the amount of salt in the tank when the brine in it has been 
reduced to 20 gallons. 
(b) When is the amount of salt in the tank greatest? 
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1.6 Exact Equations 


A great many first-order equations may be written in the form 
M(a,y)dz+ N(z,y) dy =0. (1.6.1) 


This particular format is quite suggestive, for it brings to mind a family of 
curves. Namely, if it happens that there is a function f(z, y) so that 


of af 
= ae 1.6.2 
Ap Mand at (1.6.2) 


then we can rewrite the differential equation as 


of OF 4. 
ee ay (1.6.3) 


Of course the only way that such an equation can hold is if 


af _ 
Ox Oy 


And this entails that the function f be identically constant. In other words, 


f(z,y) =C. 


This last equation describes a family of curves: for each fixed value of 
C’, the equation expresses y implicitly as a function of x, and hence gives a 
curve. Refer to Figure 1.4 for an example. In later parts of this book we shall 
learn much from thinking of the set of solutions of a differential equation as a 
smoothly varying family of curves in the plane. 

The method of solution just outlined is called the method of exact equa- 
tions. It depends critically on being able to tell when an equation of the form 
(1.6.1) can be written in the form (1.6.3). This in turn begs the question of 
when (1.6.2) will hold. 

Fortunately, we learned in calculus a complete answer to this question. 
Let us review the key points. First note that, if it is the case that 


of of 
— = “= = 1.6.4 
ap Mand a N, (1.6.4) 
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a — 


FIGURE 1.4 
A family of curves defined by x? — y? = ¢. 


then we see (by differentiation) that 


0? f OM Org ON 
= —— and — = —. 
OyOu Oy OxOy Ox 


Since mixed partials of a smooth function may be taken in any order, we find 
that a necessary condition for condition (1.6.4) to hold is that 
OM _ON 
Oy Ox” 
We call (1.6.5) the exactness condition. This provides us with a useful test for 
when the method of exact equations will apply. 
It turns out that condition (1.6.5) is also sufficient—at least on a domain 
with no holes. We refer the reader to any good calculus book (see, for instance, 


[BLK]) for the details of this assertion. We shall use our worked examples to 
illustrate the point. 


(1.6.5) 


EXAMPLE 1.6.6 Use the method of exact equations to solve 


d 
= coty: aa 


2 dx 
Solution: First, we rearrange the equation as 
2esinydx + x? cosy dy = 0. 


Observe that the role of M(z,y) is played by 2xsiny and the role of 
N(za,y) is played by x? cosy. Next we see that 


OM 
Oy 


= 2x cosy = 


Ox 


1.6. EXACT EQUATIONS 23 


Thus our necessary condition (exactness) for the method of exact equations 
to work is satisfied. We shall soon see explicitly from our calculations that it 
is also sufficient. 

We seek a function f such that Of /Oxz = M(a,y) = 2xsiny and Of /Oy = 
N(a,y) = x? cosy. Let us begin by concentrating on the first of these: 


of =2zrsiny, 


Ox 


| haem [eesinyae, 
Ox 


The left-hand side of this equation may be evaluated with the fundamental 
theorem of calculus. Treating x and y as independent variables (which is part 
of this method), we can also compute the integral on the right. The result is 


f(a,y) =a? siny + d(y). (1.6.6.1) 


Now there is an important point that must be stressed. The reader should 
by now have expected a constant of integration to show up. But in fact our 
“constant of integration” is ¢(y). This is because our integral was with respect 
to x, and therefore our constant of integration should be the most general 
possible expression that does not depend on x. That, of course, would be a 
function of y. 

Now we differentiate both sides of (1.6.6.1) with respect to y to obtain 


hence 


N(2,y) = = =x" cosyt ¢'(y). 


But of course we already know that N(x, y) = x? cosy. The upshot is that 
(y) = 0 
or 
oy) =D, 
an ordinary constant. 
Plugging this information into equation (1.6.6.1) now yields that 
f(a,y) = 2? siny+ D. 


We stress that this is not the solution of the differential equation. Before you 
proceed, please review the outline of the method of exact equations that pre- 
ceded this example. Our job now is to set 


So 
x’ -siny+D=C 
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or 7" 
x? -siny=C, 
where C = C — D. 
This is in fact the solution of our differential equation, expressed implicitly. 
If we wish, we can solve for y in terms of x to obtain 


. C 
y=sn =. | 
x 


EXAMPLE 1.6.7 Use the method of exact equations to solve the differential 
equation 
y’ dx — x? dy =0. 


Solution: We first test the exactness condition: 


OM ON 
——=2 —22 = —. 
Oy uF Ox 
The exactness condition fails. As a result, this ordinary differential equation 
cannot be solved by the method of exact equations. a 


It is a fact that, even when a differential equation fails the “exact equations 
test,” it is always possible to multiply the equation through by an “integrating 
factor” so that it will pass the exact equations test. Unfortunately, it can be 
quite difficult to discover explicitly what that integrating factor might be. We 
shall learn more about the method of integrating factors later in the book. 


EXAMPLE 1.6.8 Use the method of exact equations to solve 
e¥ dx + (ae¥ + 2y) dy =0. 


Solution: First we check for exactness: 


OM _ o [et] =e = Ine 4.24) = SE 


Oy Oy Ox Ox 
The exactness condition is verified, so we can proceed to solve for f: 
ss = M=e 
hence 
f(x,y) =a: e¥ + oy) 
But then 
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And this last expression must equal N(x, y) = xe¥ + 2y. It follows that 


$'(y) = 2y 
or 
oy) =y? +D. 
Altogether, then, we conclude that 
f(x,y) =a-e ty +D. 


We must not forget the final step. The solution of the differential equation is 


f(z, y)=C 


or . 
g-lt+ys=O-D=C. 


This time we must content ourselves with the solution expressed implicitly, 
since it is not feasible to solve for y in terms of x (at least not in an elementary, 
closed form). a 


rr 


Exercises 


Determine which of the following equations, in Exercises 1-19, is exact. Solve those 
that are exact. 


1. («+2) dy+ydz=0 
y 
2. (sinxtany + 1)dx+coszsec? ydy = 0 
3. (y—a?)dr+ (x+y) dy =0 
4. (2y? — 4a +5) da = (4—2y + 4ay) dy 
5. (y+ycosxy) dx +(x +xcos xy) dy = 0 
6. cosxcos” ydx + 2sinzsin y cos y dy = 0 
7. (sinzsiny — xe”) dy = (e¥ + cos xcos y) dx 
8. —+sin © de + sin = ay =0 
y y y y 
9. (l+y)dx+(1—«2)dy=0 
10. (2ay? + ycos x) dx + (3x?y? + sin x) dy = 0 
= y tt 
12. (2ry* + sin y) dx + (4x7y? + x cosy) dy = 0 
13. y de + xdy +adxr=0 


1 — 2? y? 
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14. 22(1+ Vx? — y) dx = Vx? — ydy 


15. («lny+cay)dx+(ylnz + xy) dy =0 


16. (ev — esc yesc? x) dx + (Qrye¥” —cscycot ycot x) dy = 0 
17. (1+ y? sin 2x) dx — 2y cos? « dy = 0 
x dx y dy 

(x2 + y2)3/2 ° (a? + y?)3/2 
3 

19. 3a27(1+Iny) dx + (= a 2u) dy =0 
y 

20. Solve 


18. =% 


ydx —xdy 
(@+y)?_ 
as an exact equation by two different methods. Now reconcile the results. 
21. Solve 


+ dy = dx 


as an exact equation. Later on (Section 1.8) we shall learn that we may 
also solve this equation as a homogeneous equation. 


22. For each of the following equations, find the value of n for which the 
equation is exact. Then solve the equation for that value of n. 


(a) (2y" +na*y) dx + (2° + xy) dy = 0 
(b) (a+ ye?””) dx + nae?*¥ dy = 0 


TS 


1.7 Orthogonal Trajectories and Families 
of Curves 
We have already noted that it is useful to think of the collection of solutions of 


a first-order differential equation as a family of curves. Refer, for instance, to 
the last example of the preceding section. We solved the differential equation 


e¥ dx + (xe¥ + 2y) dy = 0 
and found the solution set 
a-et+y=C, (1.7.1) 


For each value of C , the equation describes a curve in the plane. 
Conversely, if we are given a family of curves in the plane, then we can 
produce a differential equation from which the curves all come. Consider the 


example of the family 
e+ y?=2Cz. (1.7.2) 


The reader can readily see that this is the family of all circles tangent to the 
y-axis at the origin (Figure 1.5). 
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FIGURE 1.5 
A family of circles. 


We may differentiate the equation with respect to x, thinking of y as a 
function of x, to obtain 


d 
Qa + Wy 5 = 20. 


Now the original equation (1.7.2) tells us that 


and we may equate the two expressions for the quantity 2C (the point being 
to eliminate the constant C). The result is 


d 2 
Depa oe =a2+ ae 
dx x 
or 5 5 i 
Y yer 
pa 1.7.3 
dx 2xry ( ) 


In summary, we see that we can pass back and forth between a differential 
equation and its family of solution curves. 

There is considerable interest, given a family F of curves, to find the cor- 
responding family G of curves that are orthogonal (or perpendicular) to those 
of F. For instance, if F represents the flow curves of an electric current, then G 
will be the equipotential curves for the flow. If we bear in mind that orthogo- 
nality of curves means orthogonality of their tangents, and that orthogonality 
of the tangent lines means simply that their slopes are negative reciprocals, 
then it becomes clear what we must do. 


EXAMPLE 1.7.4 Find the orthogonal trajectories to the family of curves 


o+y=C. 
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FIGURE 1.6 
Circles centered at the origin. 


Solution: First observe that we can differentiate the given equation to obtain 
d 
2u + Qy- eG 
da 


The constant C’ has disappeared, and we can take this to be the differential 
equation for the given family of curves (which in fact are all the circles centered 
at the origin—see Figure 1.6). 

We rewrite the differential equation as 


dz yo 
Now taking negative reciprocals, as indicated in the discussion right before 
this example, we obtain the new differential equation 


ey 
dx « 
for the family of orthogonal curves. 


We may easily separate variables to obtain 
1 1 
—dy=-—dz. 
y x 


Applying the integral to both sides yields 


1 1 
[oa= [a0 
y a 
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FIGURE 1.7 
Lines through the origin. 


or 
In|y| =Inj2|+C. 
With some algebra, this simplifies to 
ly| = Dia| 
or 
y=+De. 


The solution that we have found comes as no surprise: the orthogonal 
trajectories to the family of circles centered at the origin is the family of lines 
through the origin. See Figure 1.7. 


EXAMPLE 1.7.5 Find the family of orthogonal trajectories to the curves 
y = Cz”. 
Solution: We differentiate to find that 


dy 
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or 
_ 1s dy 
Qe dx’ 
But the original equation tells us that 


The point here is to eliminate C’, and we can do so by equating the two 
expressions for C’. Thus 
1 dy sy 
224 dx x 
or 
dy _ 2y 
dx x 


Then the family of orthogonal curves will satisfy 


This equation is easily solved by separation of variables. The solution is 


your C— 22/2. 


We leave the details to the interested reader. |_| 


Exercises 


1. Sketch each of the following families of curves. In each case, find the 
family of orthogonal trajectories, and add those to your sketch. 


(a) ry=c (d) r=c(1+cos6) 
(b) y=ce? (e) y=ce 
(c) r+y=c (f) «-y?=c 


2. What are the orthogonal trajectories of the family of curves y = cx‘? 
What are the orthogonal trajectories of the family of curves y = cx” for 
n a positive integer? Sketch both families of curves. How does the family 
of orthogonal trajectories change when n is increased? 

3. Sketch the family y® = 4c(a# +c) of all parabolas with axis the a-axis 
and focus at the origin. Find the differential equation of this family. 
Show that this differential equation is unaltered if dy/dx is replaced by 
—dx/dy. What conclusion can be drawn from this fact? 

4. In each of parts (a) through (f), find the family of curves that satisfy 
the given geometric condition (you should have six different answers for 
the six different parts of the problem): 
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(a) The part of the tangent cut off by the axes is bisected by the point 
of tangency. 

(b) The projection on the x-axis of the part of the normal between (x, y) 
and the z-axis has length 1. 

(c) The projection on the x-axis of the part of the tangent between (x, y) 
and the z-axis has length 1. 

(d) The part of the tangent between (x,y) and the z-axis is bisected by 
the y-axis. 

(e) The part of the normal between (x,y) and the y-axis is bisected by 
the x-axis. 

(f) The point (x,y) is equidistant from the origin and the point of in- 
tersection of the normal with the z-axis. 

5. A curve rises from the origin in the x-y plane into the first quadrant. The 
area under the curve from (0,0) to (a, y) is one third of the area of the 
rectangle with these points as opposite vertices. Find the equation of the 
curve. 

6. Find the differential equation of each of the following one-parameter fam- 
ilies of curves: 

(a) y=asin(x +c) 

(b) all circles through (1,0) and (—1,0) 

(c) all circles with centers on the line y = x and tangent to both axes 

(d) all lines tangent to the parabolas 7? = 4y (Hint: The slope of the 
tangent line at (2a,a”) is a. 


(e) all lines tangent to the unit circle 2? +y? =1 


7. Use your symbol manipulation software, such as Maple or Mathematica, 
to find the orthogonal trajectories to each of these families of curves: 


(a) y=sina + cx? 

(b) y=clnx+zaz, x>0 
Cos x 

(c) Us ia x>0 

(d) y=sinz+ccosz 


rr 


1.8 Homogeneous Equations 


The reader should be cautioned that the word “homogeneous” has two mean- 
ings in this subject (as mathematics is developed simultaneously by many 
people all over the world, and they do not always stop to cooperate on their 
choices of terminology). 

One usage, which we shall see later, is that an ordinary differential equa- 
tion is homogeneous when the right-hand side is zero; that is, there is no 
forcing term. 

The other usage will be relevant to the present section. It bears on the 
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“balance” of weight among the different variables. It turns out that a differ- 
ential equation in which the x and y variables have a balanced presence is 
amenable to a useful change of variables. That is what we are about to learn. 

First of all, a function g(x,y) of two variables is said to be homogeneous 
of degree a, for a a real number, if 


g(ta, ty) = t°g(a, y) for allt >0. 


As examples, consider: 


e Let g(x,y) = 2? + xy. Then g(tx,ty) = (ta)? + (ta) - (ty) = ta? + Pry = 
t? - g(x,y), so g is homogeneous of degree 2. 


e Let g(x,y) = sin|[xz/y]. Then g(ta, ty) = sin[(tx)/(ty)] = sin[x/y] 
= t°- g(x,y), so g is homogeneous of degree 0. 


e Let g(z,y) = Va? + y?. Then g(ta,ty) = (ta)? + (ty)? = tYa?+y? = 


t- g(x,y), so g is homogeneous of degree 1. 


If a function is not homogeneous in the sense just indicated, then we call it 
inhomogeneous. 
In case a differential equation has the form 


M(x, y) da + N(x,y) dy =0 


and M,N have the same degree of homogeneity, then it is possible to perform 
the change of variable z = y/x and make the equation separable (see Sec- 
tion 1.4). Of course we then have a well-understood method for solving the 
equation. 

The next examples will illustrate the method. 


EXAMPLE 1.8.1 Use the method of homogeneous equations to solve the equa- 
tion 
(a+ y) dx —(a—y)dy=0. 


Solution: This equation is not exact. However, observe that M(z,y) = x+y 
and N(x, y) = —(x—y) and each is homogeneous of degree 1. We thus rewrite 
the equation in the form 


dy _a+y 
dx x—y! 
Dividing numerator and denominator by 2, we finally have 
dy 1+#4 
= 7 1.8.1.1 
dx 1-4 ( ) 


The point of these manipulations is that the right-hand side is now plainly 
homogeneous of degree 0. We introduce the change of variable 


y 
== 1.8.1.2 
z=4 (1.8.1.2) 
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hence 
y= 2x 
and d d 
y Zz 
pee Beas 1.8.1. 
a z+2 a (1.8.1.3) 
Putting (1.8.1.2) and (1.8.1.3) into (1.8.1.1) gives 
dz 1l+z 
aa aE ~ lag 


Of course this may be rewritten as 


dz 1+2? 
" l-<z 

or 
1l-—<z dx 


14+ 2? ae 
Notice that we have separated the variables!! We apply the integral, and 
rewrite the left-hand side, to obtain 


<—-/4-/F 


1+ 22 1+ 22 x 


The integrals are easily evaluated, and we find that 
1 2 
arctan z — 5 nl +2°)=Inz+C. 


Now we return to our original notation by setting z = y/x. The result is 


1 2 
arctan = —— In ie =Inrz+C 
x 2 2 


or 


arctan ~ — In «/a? + a Ou 
x 


Thus we have expressed y implicitly as a function of «, all the derivatives are 
gone, and we have solved the differential equation. | 


EXAMPLE 1.8.2 Solve the differential equation 
zy’ = 2x + 3y. 


Solution: It is plain that the equation is first-order linear, and we encourage 
the reader to solve the equation by that method for practice and comparison 
purposes. Instead, developing the ideas of the present section, we shall use the 
method of homogeneous equations. 
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If we rewrite the equation as 
—(24 + 3y)dx+ady=0, 
then we see that each of M = —(2x+3y) and N = z is homogeneous of degree 
1. Thus we render the equation as 


dy _ 2z+3y _ 2+3(y/z) 
dx x 7 1 


The right-hand side is homogeneous of degree 0, as we expect. 
We set z = y/ax and dy/dx = z + x[dz/dz]. The result is 


= 943%) 
x 


d 
pe See a By. 
dx x 


The equation separates, as we anticipate, into 


dz dx 


Q2+22 a! 
This is easily integrated to yield 


1 
-InQl+z)=Imnx+C 


2 

or 

z= Dr? —-1. 
Resubstituting z = y/a gives 

fi = 

x 
hence 

y= Dz —<. | 
Exercises 


1. Verify that each of the following differential equations is homogeneous, 
and then solve it. 


(a) (a — 2y*) da + rydy =0 (f) (a@—y)dx (x+y) dy =0 
(b) ay’ — 3xy — 2y” =0 (g) zy’ = 2x — 6y 
(c) ay’ =3(a? +y7)- arctan 2 +ay (h) zy = /r?+y? 
. ¥) dy oo U ae eee 
(d) x (sin 4) Y= ysin 4 +0 (i) ay =y + 2zry 
x/ dx x . 4 ; 


(e) wy’ =yt2ce7¥/* (j)  (#° + y?) dx — zy? dy = 0 
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2. Use rectangular coordinates to find the orthogonal trajectories of the 
family of all circles tangent to the x-axis at the origin. 


3. (a) If ae ¥ bd then show that h and k can be chosen so that the substi- 
tution « = z—h, y = w —k reduces the equation 


dy _P ax+by+c 
dx dx+ey+f 


to a homogeneous equation. 


(b) If ae = bd then show that there is a substitution that reduces the 
equation in (a) to one in which the variables are separable. 


4. Solve each of the following differential equations. 


dy a«+t+yt+4 dy r+y-l1 
(a) = = ———_ (d) = = —*— 
dx «+4y+2 


(b) = = ——— (e) (2x + 3y — 1) dx 
—4(a +1)dy=0 


(c) (2x — 2y)dx+ (y—1)dy=0 


5. By making the substitution z = y/x” (equivalently y = zx”) and choos- 
ing a convenient value of n, show that the following differential equations 
can be transformed into equations with separable variables, and then 


solve them. 
dy _1—ay? dy _y—ay? 
(a) dx —-.2axy (c) dx «+a2y 
(b) dy = 2+ 3ay? 
dx — Aar?y 


6. Show that a straight line through the origin intersects all integral curves 
of a homogeneous equation at the same angle. 


7. Use your symbol manipulation software, such as Maple or Mathematica, 
to find solutions to each of the following homogeneous equations. (Note 
that these would be difficult to do by hand.) 

(a) y' =sin[y/a] — cos[y/z] 
(b) ede —2dy=0 
x 


dy _ x? — xy 
(c) dx _-y? cos(x/y) 
(a) y! = 2 - tanly/a] 


x 


a 


1.9 Integrating Factors 


We used a special type of integrating factor in Section 1.5 on first-order linear 
equations. At that time, we suggested that integrating factors may be applied 
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in some generality to the solution of first-order differential equations. The 
trick is in finding the integrating factor. 

In this section we shall discuss this matter in some detail, and indicate 
the uses and the limitations of the method of integrating factors. 

First let us illustrate the concept of integrating factor by way of a concrete 
example. The differential equation 


ydz + (x?y — 2) dy =0 (1.9.1) 


is plainly not exact, just because 0M/Oy = 1 while ON/Ox = 2xy — 1, and 
these are unequal. However, if we multiply equation (1.9.1) through by a factor 
of 1/x? then we obtain the equivalent equation 


1 
4 det (v-=) = 0, 
x x 


and this equation is exact (as the reader may easily verify by calculating 
OM/dy and ON/Ox). And of course we have a direct method (see Section 1.6) 
for solving such an exact equation. 

We call the function 1/x? in the last paragraph an integrating factor. It 
is obviously a matter of some interest to be able to find an integrating factor 
for any given first-order equation. So, given a differential equation 


M(x, y) dx + N(a,y)dy=0, 
we wish to find a function p(x, y) such that 
u(x, y)- M(x, y) dx + u(a,y)- N(a,y) dy =0 


is exact. This entails 
A(u-M) _ A(u-N) 


Oy Ox 
Writing this condition out, we find that 
OM Ou ON Ou 
— + M— = p— + N—. 
M Oy Oy Pog Ox 
This last equation may be rewritten as 


1 (wH - uot) _ OM AN 


nw Ox Ay) Oy Ow” 


Now we use the method of wishful thinking: we suppose not only that an 
integrating factor ~ exists, but in fact that one exists that only depends on 
the variable x (and not at all on y). Then the last equation reduces to 


ldu — 0M/dy— ON/dx 
udx N ; 
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Notice that the left-hand side of this new equation is a function of x only. 
Hence so is the right-hand side. Call the right-hand side g(x). Notice that g 
is something that we can always compute. 


Thus 
= oH _ g(x) 
pdx 2 
hence (in u) 
np) 


or 
Inu = J o(e)az. 


We conclude that, in case there is an integrating factor 4 that depends on x 
only, then 


ina 


where 
_ OM/dy — ON/0x 
ae 
can always be computed directly from the original differential equation. 
Of course the best way to understand a new method like this is to look 
at some examples. This we now do. 


EXAMPLE 1.9.2 Solve the differential equation 
(xy — 1) dx + (x? — xy) dy =0. 


Solution: You may plainly check that this equation is not exact. It is also 
not separable. So we shall seek an integrating factor that depends only on «. 
Now 
OM/dy—-ON/dx — [a] — [2a -y] —“+y 1 
£) = EF 


9) N x2 — xy —a(-2+y) x 


This g depends only on z, signaling that the methodology we just developed 
will actually work. 
We set 
p(x) = ed g(a) dx = ed —l/adx = e7 Ine = 1 : 
x 
This is our integrating factor. We multiply the original differential equation 
through by 1/2 to obtain 


(v- =) dx + (x—y) dy=0. 


The reader may check that this equation is certainly exact. We omit the details 
of solving this exact equation, since that technique was covered in Section 1.6. 
|_| 
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Of course the roles of y and x may be reversed in our reasoning for finding 
an integrating factor. In case the integrating factor 4. depends only on y (and 
not at all on x) then we set 


OM /dy — ON/Ox 
h(y) = — Mey ON/0 


and define 
u(y) = ef My) dy, 


EXAMPLE 1.9.3 Solve the differential equation 
y da + (2a — ye”) dy = 0. 
Solution: First observe that the equation is not exact as it stands. Second, 
OM/dy—ON/Ozr = 1-2 — -l 
N 2x—yey 2x — yey 
does not depend only on x. So instead we look at 


_OM/dy—ON/Oz 1-2 1 


M yoy? 
and this expression depends only on y. So it will be our h(y). We set 


by) = ed h(y) dy = ed Wy dy =y. 


Multiplying the differential equation through by u(y) = y, we obtain the 
new equation 
y? dx + (2ry — y?e¥) dy = 0. 


You may easily check that this new equation is exact, and then solve it by the 
method of Section 1.6. a 


We conclude this section by noting that the differential equation 
ry? dz + yx? dy = 0 
has the properties that 
e it is not exact; 
OM /dy — ON/O0a 
Ppa ca ca 
N 
OM /dy — ON/O0z 
Ce 
M 


Thus the method of the present section is not a panacea. We shall not always 
be able to find an integrating factor. Still, the technique has its uses. 


does not depend on z only; 


does not depend on y only. 
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Exercises 


1. Solve each of the following differential equations by finding an integrating 
factor. 
(a) (3a? — y?) dy — 2xydx = 0 
(b) (ay — 1) dx + (a? — xy) dy =0 
(c) «dy+ydz + 32°y* dy =0 
(d) e” dx + (e* cot y + 2ycscy) dy =0 
(e) («+ 2)sinydx + x cosy dy =0 
(f) ydx + (x — 2xy?) dy =0 
(g) (a + 3y”) dx + 2xy dy =0 
(h) yda + (2x — ye”) dy =0 
(i) (ylny — 2ay) dx + (x+y) dy =0 
(j) (y?+ayt+1)dxt+ (a? +ay+1)dy=0 
(k) (a? + xy?) dx + 3y? dy =0 
2. Show that if (0M/dy — ON/dx)/(Ny — Mz) is a function g(z) of the 
product z = zy, then 


paella 


is an integrating factor for the differential equation 
M(a,y)dx + N(a,y)dy =0. 
3. Under what circumstances will the differential equation 
M(a,y) dx + N(x, y) dy =0 


have an integrating factor that is a function of the sum z= a+ y? 
4. Solve the following differential equation by making the substitution z = 
y/«” (equivalently y = xz") and choosing a convenient value for n: 
dy 2 3 
Lil sat Sy ae re Ea 
dx x y x? 
5. Use your symbol manipulation software, such as Maple or Mathematica, 
to write a routine for finding the integrating factor for a given differential 
equation. 


ne 


1.10 Reduction of Order 


It is a fact that virtually any ordinary differential equation can be transformed 
to a first-order system of equations. This is, in effect, just a notational trick, 
but it emphasizes the centrality of first-order equations and systems. In the 
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present section, we shall learn how to reduce certain higher-order equations 
to first-order equations—ones which we can frequently solve. 

We begin by concentrating on differential equations of order 2 (and think- 
ing about how to reduce them to equations of order 1). In each differential 
equation in this section, x will be the independent variable and y the de- 
pendent variable. So a typical second-order equation will involve x,y, y',y”. 
The key to the success of each of the methods that we shall introduce in this 
section is that one of these four variables must be missing from the equation. 


1.10.1 Dependent Variable Missing 
In case the variable y is missing from our differential equation, we make the 
substitution y’ = p. This entails y” = p’. Thus the differential equation is 
reduced to first order. 
EXAMPLE 1.10.1 Solve the differential equation 

xy” _ y’ a5 3? 
using reduction of order. 


Solution: Notice that the dependent variable y is missing from the differential 
equation. We set y’ = p and y” = p’, so that the equation becomes 


xp’ —p = 3x7. 


Observe that this new equation is first-order linear. We think of x as the 
independent variable and p as the new dependent variable. 
We write the equation in standard form as 


We may solve this equation by using the integrating factor p(x) = ef ~/7 4 = 
1/ax. Thus 


1 1 
—p'—-—p=3 
xv x 
SO ; 
1 
x 
or 


Performing the integrations, we conclude that 


1 
—p=3r+C, 
x 
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hence 
p(x) = 327+ Cz. 


Now we recall that p = y’, so we make that substitution. The result is 
y =3a7+Cz2, 
hence C 
yao + >a" +D=2" + Ex’ +D. 


We invite the reader to confirm that this is the complete and general solution 
to the original differential equation. (Note that, since the original equation is 
second order, there are two undetermined constants in the solution.) a 


EXAMPLE 1.10.2 Find the solution of the differential equation 


Solution: We note that y is missing, so we make the substitution p = y’, 
p' =y". Thus the equation becomes 


p = xp! 
or d 
2 2 ap 
=7-—. 
= dx 


This equation is amenable to separation of variables. 
The result is 


dx dp 
a P 
which integrates to 
1 1 
——=>--+F 
x 
or 
228 
ame + Ex 


for some unknown constant E. We resubstitute p = y’ and write the equation 


as 
dip - 0 Ae al 1 Le 2 A 1 


dx 1+Bx 1l+&ce E 1+Ex E EF 1+£x’ 
Now we integrate to obtain finally that 


Ay 1 

is the general solution of the original differential equation. 
Note here that we have used our method of reduction of order to solve a 
nonlinear differential equation of second order. |_| 
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1.10.2 Independent Variable Missing 


In case the variable x is missing from our differential equation, we make the 
substitution y’ = p. This time the corresponding substitution for y” will be a 
bit different. To wit, 


» dp _ dpdy _ dp- 


~ dx dydx dy - 


This change of variable will reduce our differential equation to first order. 
In the reduced equation, we treat p as the dependent variable and y as the 
independent variable. 


EXAMPLE 1.10.3 Solve the differential equation 
y" ot k?y —0 
(where it is understood that & is an unknown real constant). 


Solution: We notice that the independent variable x is missing. So we make 
the substitution 


The equation then becomes 


In this new equation we can separate variables: 
pdp = —k*y dy 


hence, integrating, 


2 2 
Pp 2¥ 
— = —-k*~—+C 
5 5 + ; 


so that 


=+,/D — ky? = +k /E—y?. 


Now we resubstitute p = dy/dx to obtain 


d 

ae = tkh/E—y?. 

dx 

We can separate variables to obtain 
ara 
E-y? 
hence (integrating) 
sin fy ee =tket+F 
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or 
= sin(+ke Te F) 


thus 


Y= VEsin(tka =f F) . 


Now we apply the sum formula for sine to rewrite the last expression as 


y = VEsin(+ka) cos F + VE cos(£ka) sin F . 


A moment’s thought reveals that we may consolidate the constants and finally 
write our general solution of the differential equation as 


y = Asin(kx) + Bcos(ka) . a 


We shall learn in the next chapter a different, and perhaps more expe- 
ditious, method of attacking examples of the last type. It should be noted 
quite plainly in the last example, and also in some of the earlier examples of 
the section, that the method of reduction of order basically transforms the 
problem of solving one second-order equation to a new problem of solving two 
first-order equations. Examine each of the examples we have presented and 
see whether you can say what the two new equations are. 

In the next example, we shall solve a differential equation subject to an 
initial condition. This will be an important idea throughout the book. Solving 
a differential equation gives rise to a family of functions. Specifying an initial 
condition is a natural way to specialize down to a particular solution. In 
applications, these initial conditions will make good physical sense. 


EXAMPLE 1.10.4 Use the method of reduction of order to solve the differential 
equation 


with initial conditions y(0) = 0 and y’/(0) = 1. 


Solution: Noting that the dependent variable x is missing, we make the 
substitution 


So the equation becomes 


We of course may separate variables, so the equation becomes 


dp = e” dy. 
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This is easily integrated to give 
p=eF+C. 


Now we resubstitute p = y’ to find that 


y =e+C 
or F 

ED od. 

ees Cc. 


Because of the initial conditions [dy/dz](0) = 1 and y(0) = 0, we may conclude 
right away that 


1=e°+C 
hence that C' = 0. Thus our equation is 
dy 
SF _ ey 
dx “ 
or ' 
ate dz. 
ey 
This may be integrated to 
-e Y=ax4+D. 


Of course we can rewrite the equation finally as 
y=—In(-c+ E). 
Since y(0) = 0, we conclude that 
y(a) = —In(—a2 + 1) 


is the solution of our initial value problem. | 


TT ne 


Exercises 
1. Solve the following differential equations using the method of reduction 
of order. 
(a) yy" + (y')? =0 (e) yy” =14 (y')” 
(b) ay =y' + (y’) (f) wo) =0 
(c) y —k’'y=0 (g) ty +y =4e 


(d) ay” =y' + (y')? 
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FIGURE 1.8 
The hanging chain. 


2. Find the specified particular solution of each of the following equations. 
(a) (a? + 2y’)y” +2a2y’=0, y=1 andy’ =0 when x =0 
(b) yy” =y?y'+(y')?, y=—1/2 and y’ =1 when « =0 
(c) y’ =y'e’, y=Oandy’ =2 when rx =0 

3. Solve each of these differential equations using both methods of this sec- 
tion, and reconcile the results. 


(a) yy” =14+(y'" (b) y+’)? =1 


4. Inside the Earth, the force of gravity is proportional to the distance from 
the center. A hole is drilled through the Earth from pole to pole and a 
rock is dropped into the hole. This rock will fall all the way through the 
hole, pause at the other end, and return to its starting point. How long 
will the complete round trip take? 


( 
1.11 The Hanging Chain and Pursuit Curves 
1.11.1 The Hanging Chain 


Imagine a flexible steel chain, attached firmly at equal height at both ends, 
hanging under its own weight (see Figure 1.8). What shape will it describe as 
it hangs? 

This is a classical problem of mechanical engineering, and its analytical 
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FIGURE 1.9 
Analysis of the hanging chain. 


solution involves calculus, elementary physics, and differential equations. We 
describe it here. 

We analyze a portion of the chain between points A and B, as shown 
in Figure 1.9, where A is the lowest point of the chain and B = (2,y) is a 
variable point. 

We let 


e T, be the horizontal tension at A; 
e T> be the component of tension tangent to the chain at B; 
e w be the weight of the chain per unit of length. 


Here 7), 7>,w are numbers. Figure 1.10 exhibits these quantities. 

Notice that, if s is the length of the chain between two given points, 
then ws is the downward force of gravity on this portion of the chain; this 
is indicated in the figure. We use the symbol 6 to denote the angle that the 
tangent to the chain at B makes with the horizontal. 

By Newton’s first law we may equate horizontal components of force to 
obtain 

T, = Tocosé. (1.11.1) 


Likewise, we equate vertical components of force to obtain 
ws = Tosiné. (1.11.2) 


Dividing the right side of (1.11.2) by the right side of (1.11.1) and the left side 
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FIGURE 1.10 
The quantities T, and Tb. 


of (1.11.2) by the left side of (1.11.1) and equating gives 


Oo = tand. 
Ti 
Think of the hanging chain as the graph of a function: y is a function of 
x. Then y’ at B equals tan@ so we may rewrite the last equation as 
,_ ws 
y= Ty 
We can simplify this equation by a change of notation: set q = y’. Then we 


have 
w 


= T° 

If Az is an increment of x then Ag = q(a+Az)—q(z) is the corresponding 
increment of g and As = s(a + Az) — s(x) the increment in s. As Figure 1.11 
indicates, As is well approximated by 


= (1+) 


q(x) (x). (1.11.3) 


1/2 1/2 


As © ((Az)? + (y'Az)”) Ag = (1+¢)'/Az. 


Thus, from (1.11.3), we have 
w w 
Aq=—As= —(1+q@)¥?Az. 
ie a ae 


Dividing by Az and letting Az tend to zero gives the equation 


MY ey, (1.11.4) 
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[(Ax) +(y'Ax)’] 


: y'Ax 


FIGURE 1.11 
An increment of the chain. 


This may be rewritten as 


/ooeea!/e 
ater TJ 


It is trivial to perform the integration on the right side of the equation, and 
a little extra effort enables us to integrate the left side (use the substitution 
u = tany, or else use inverse hyperbolic trigonometric functions). Thus we 
obtain i 

sinh"! g = —a+C. 

inh~~ q Ti x 
We know that the chain has a horizontal tangent when x = 0 (this corresponds 
to the point A—Figure 1.10). Thus q(0) = y’(0) = 0. Substituting this into 
the last equation gives C' = 0. Thus our solution is 


or 


or 


Finally, we integrate this last equation to obtain 


T, W 
y(x) wy oo (= c) oe oe 
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where D is a constant of integration. The constant D can be determined from 
the height ho of the point A from the x-axis: 


hence 


Our hanging chain is thus completely described by the equation 


T T 
y(x) = — cosh (+) + (1 -_ =) : 
W Ty W 


This curve is called a catenary, from the Latin word for chain (catena). Cate- 
naries arise in a number of other physical problems. The St. Louis arch is in 
the shape of a catenary. 


Math Nugget 


Many of the special curves of classical mathematics arise in 
problems of mechanics. The tautochrone property of the cy- 
cloid curve (that a bead sliding down the curve will reach 
the bottom in the same time, no matter where on the curve 
it begins) was discovered by the great Dutch scientist Chris- 
tiaan Huygens (1629-1695). He published it in 1673 in his 
treatise on the theory of pendulum clocks, and it was well 
known to all European mathematicians at the end of the 
seventeenth century. When Johann Bernoulli published his 
discovery of the brachistochrone (that special curve connect- 
ing two points down which a bead will slide in the least 
possible time) in 1696, he expressed himself in the follow- 
ing exuberant language (of course, as was the custom of the 
time, he wrote in Latin): “With justice we admire Huygens 
because he first discovered that a heavy particle falls down 
along a common cycloid in the same time no matter from 
what point on the cycloid it begins its motion. But you will 
be petrified with astonishment when I say that precisely this 
cycloid, the tautochrone of Huygens, is our required brachis- 
tochrone.” 
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FIGURE 1.12 
A tractrix. 


1.11.2 Pursuit Curves 


A submarine speeds across the ocean bottom in a particular path, and a de- 
stroyer at a remote location decides to engage in pursuit. What path does the 
destroyer follow? Problems of this type are of interest in a variety of applica- 
tions. We examine a few examples. The first one is purely mathematical, and 
devoid of “real-world” trappings. 


EXAMPLE 1.11.1 A point P is dragged along the z-y plane by a string PT of 
fixed length a. If T begins at the origin and moves along the positive y-axis, 
and if P starts at the point (a,0), then what is the path of P? 


Solution: The curve described by the motion of P is called, in the classical 
literature, a tractriz (from the Latin tractum, meaning “drag”). Figure 1.12 
exhibits the salient features of the problem. 
Observe that we can calculate the slope of the pursuit curve at the point 
P in two ways: (i) as the derivative of y with respect to x and (ii) as the ratio 
of sides of the relevant triangle. This leads to the equation 
dy _ a 


dx x 
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This is a separable, first-order differential equation. We write 
let age 
dy = — | ———dz. 
x 


Performing the integrations (the right-hand side requires the trigonometric 
substitution x = asin a), we find that 


2 72 
pan (S4YE=2) _ Yama +e 


is the equation of the tractrix.? iz 


EXAMPLE 1.11.2 A rabbit begins at the origin and runs up the y-axis with 
speed a feet per second. At the same time, a dog runs at speed b from the 
point (c,0) in pursuit of the rabbit. What is the path of the dog? 


Solution: At time t, measured from the instant both the rabbit and the dog 
start, the rabbit will be at the point R = (0,at) and the dog at D = (2, y). 
We wish to solve for y as a function of x. Refer to Figure 1.13. 

The premise of a pursuit analysis is that the line through D and R is 
tangent to the path—that is, the dog will always run straight at the rabbit. 
This immediately gives the differential equation 

dy _y—at 


dx x 


This equation is a bit unusual for us, since x and y are both unknown 
functions of t. First, we rewrite the equation as 


xy’ —y=—at. 


(Here the ’ on y stands for differentiation in x.) 
We differentiate this equation with respect to x, which gives 


Since s is arc length along the path of the dog, it follows that ds/dt = b. Hence 


dt dt ds 1 i+G) 
di ds dab oes 
?This curve is of considerable interest in other parts of mathematics. If it is rotated 
about the y-axis then the result is a surface that gives a model for non-Euclidean geometry. 
The surface is called a pseudosphere in differential geometry. It is a surface of constant 
negative curvature (as opposed to a traditional sphere, which is a surface of constant positive 
curvature). 


52 CHAPTER 1: WHAT IS A DIFFERENTIAL EQUATION? 


FIGURE 1.13 
A pursuit curve. 
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here the minus sign appears because s decreases when x increases (see Figure 
1.13). Of course we use the familiar expression for the derivative of arc length. 
Combining the last two displayed equations gives 


ny" =F Vit WP. 


For convenience, we set k = a/b, y’ = p, and y” = dp/dz (the latter two 
substitutions being one of our standard reduction of order techniques). Thus 
we have 

dp dx 


Viep es 


Now we may integrate, using the condition p= 0 when x = c. The result is 


In (p+ VI+P) =In (*)’ : 


When we solve for p, we find that 


In order to continue the analysis, we need to know something about the 
relative sizes of a and b. Suppose, for example, that a < b (so k < 1), meaning 
that the dog will certainly catch the rabbit. Then we can integrate the last 
equation to obtain 


Wo HES) ape 


Since y = 0 when x =, we find that D = ck. Of course the dog catches 
the rabbit when x = 0. Since both exponents on x are positive, we can set 
x = 0 and solve for y to obtain y = ck as the point at which the dog and the 
rabbit meet. | 


We invite the reader to consider what happens in this last example when 
a= band hence k = 1. 


SS 


Exercises 


1. Refer to our discussion of the shape of a hanging chain. Show that the 
tension T at an arbitrary point (x,y) on the chain is given by wy. 
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2. If the hanging chain supports a load of horizontal density L(a), then 
what differential equation should be used in place of (1.11.4)? 

3. What is the shape of a cable of negligible density (so that w = 0) that 
supports a bridge of constant horizontal density given by L(x) = Lo? 

4. Ifthe length of any small portion of an elastic cable of uniform density 
is proportional to the tension in it, then show that it assumes the shape 
of a parabola when hanging under its own weight. 

5. A curtain is made by hanging thin rods from a cord of negligible density. 
If the rods are close together and equally spaced horizontally, and if the 
bottom of the curtain is trimmed so that it is horizontal, then what is 
the shape of the cord? 

6. What curve lying above the z-axis has the property that the length of 
the arc joining any two points on it is proportional to the area under that 
arc? 

7. Show that the tractrix discussed in Example 1.11.1 is orthogonal to the 
lower half of each circle with radius a and center on the positive y-axis. 

8. (a) In Example 1.11.2, assume that a < b (so that k < 1) and find y as 

a function of xz. How far does the rabbit run before the dog catches 
him? 

(b) Assume now that a = 6, and find y as a function of x. How close 
does the dog come to the rabbit? 


a 


1.12 Electrical Circuits 


We have alluded elsewhere in the book to the fact that our analyses of vibrat- 
ing springs and other mechanical phenomena are analogous to the situation 
for electrical circuits. Now we shall examine this matter in some detail. 

We consider the flow of electricity in the simple electrical circuit exhibited 
in Figure 1.14. The elements that we wish to note are these: 


A. A source of electromotive force (emf) E—perhaps a battery or 
generator—which drives an electric charge and produces a current 
I. Depending on the nature of the source, E may be a constant or 
a function of time. 


B. A resistor of resistance R, which opposes the current by producing 
a drop in emf of magnitude 


Erp=RI. 


This equation is called Ohm’s law. 
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FIGURE 1.14 
A simple electric circuit. 


Math Nugget 


Georg Simon Ohm (1787-1854) was a German physicist 
whose only significant contribution to science was his discov- 
ery of the law that now bears his name. When he announced 
it in 1827, it seemed too good to be true; sadly, it was not 
generally believed. Ohm was, as a consequence, deemed to 


be unreliable. He was subsequently so badly treated that he 
resigned his professorship at Cologne and lived for several 
years in obscurity and poverty. Ultimately, it was recognized 
that Ohm was right all along. So Ohm was vindicated. One 
of Ohm’s students in Cologne was Peter Dirichlet, who later 
became one of the most distinguished German mathemati- 
cians of the nineteenth century. 


C. An inductor of inductance L, which opposes any change in the cur- 
rent by producing a drop in emf of magnitude 


dl 


| ene Eran 
E dt 
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D. A capacitor (or condenser) of capacitance C, which stores the charge 
Q. The charge accumulated by the capacitor resists the inflow of 
additional charge, and the drop in emf arising in this way is 


Furthermore, since the current is the rate of flow of charge, and 
hence the rate at which charge builds up on the capacitor, we have 


dQ 
I — “dt . 

Those unfamiliar with the theory of electricity may find it helpful to 
draw an analogy here between the current J and the rate of flow of water 
in a pipe. The electromotive force EF plays the role of a pump producing 
pressure (voltage) that causes the water to flow. The resistance R is analogous 
to friction in the pipe—which opposes the flow by producing a drop in the 
pressure. The inductance L is a sort of inertia that opposes any change in 
flow by producing a drop in pressure if the flow is increasing and an increase 
in pressure if the flow is decreasing. To understand this last point, think of 
a cylindrical water storage tank that the liquid enters through a hole in the 
bottom. The deeper the water in the tank (Q), the harder it is to pump new 
water in; and the larger the base of the tank (C) for a given quantity of stored 
water, the shallower is the water in the tank and the easier to pump in new 
water. 

These four circuit elements act together according to Kirchhoff’s Law, 
which states that the algebraic sum of the electromotive forces around a closed 
circuit is zero. This physical principle yields 


E-— Ep- Ey, -Ec=0 


or 


dl 1 
E-— RI-L—- =—Q=0 
dt Ge , 
which we rewrite in the form 
dl 1 
L— I+s=Q=E. 1.12.1 
qth t+ Ge ( ) 


We may perform our analysis by regarding either the current J or the 
charge Q as the dependent variable (obviously time t will be the independent 
variable). 


e In the first instance, we shall eliminate the variable Q from (1.12.1) by 
differentiating the equation with respect to t and replacing dQ/dt by I (since 
current is indeed the rate of change of charge). The result is 


aI I 1. dE 
[cay oer gael 
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e In the second instance, we shall eliminate the I by replacing it by dQ/dt. 
The result is 
dQ dQ sil 


Lag +R + Ga. (1.12.2) 


Both these ordinary differential equations are second-order, linear with 
constant coefficients. We shall study these in detail in Section 2.1. For now, 
in order to use the techniques we have already learned, we assume that our 
system has no capacitor present. Then, integrating, the equation becomes 


Lo +RI= E. (1.12.3) 


EXAMPLE 1.12.1 Solve equation (1.12.3) when an initial current Ip is flowing 
and a constant emf Eo is impressed on the circuit at time t = 0. 


Solution: For ¢ > 0 our equation is 


dI 
| aang 2 eae om 
a 


We can separate variables to obtain 


dl 1 


ee 
E=—RI 


We integrate and use the initial condition (0) = Jp to obtain 
R 
In(Eo — RI) = =a In(£o — Ro), 


hence 


We have learned that the current J consists of a steady-state component 
Eo/R and a transient component (Ip — Eo/R)e~**/" that approaches zero as 
t — +oo. Consequently, Ohm’s law Ey = RI is nearly true for t large. We 
also note that, if Jo = 0, then 


I pee 


if instead Ep = 0, then I = Ipe~**/". 
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TT 


Exercises 


1. In Example 1.12.1, with Jo = 0 and Eo ¥ 0, show that the current in the 
circuit builds up to half its theoretical maximum in (Ln 2)/R seconds. 
2. Solve equation (1.12.3) for the case in which the circuit has an initial 
current Jo and the emf impressed at time t = 0 is given by 
(a) E= Eoe"™ 
(b) E = Epsinwt 
3. Consider a circuit described by equation (1.12.3) and show that 
(a) Ohm’s law is satisfied whenever the current is at a maximum or 
minimum. 
(b) The emf is increasing when the current is at a minimum and decreas- 
ing when it is at a maximum. 
4. If L = 0 in equation (1.12.2) and if Q = 0 when ¢ = 0, then find the 
charge buildup Q = Q(t) on the capacitor in each of the following cases: 
(a) FE is aconstant Eo 
(b) E= Eve 
(c) E = Eqcoswt 
5. Use equation (1.12.1) with R = 0 and EF = 0 to find Q = Q(t) and J = 
I(t) for the discharge of a capacitor through an inductor of inductance 
L, with initial conditions Q = Qo and J = 0 when t = 0. 


a 


1.13 The Design of a Dialysis Machine 


The purpose of the kidneys is to filter out waste from the blood. When the 
kidneys malfunction, the waste material can build up to dangerous levels and 
be poisonous to the system. Doctors will use a kidney dialysis machine (or 
dialyzer) to assist the kidneys in the cleansing process. 

How does the dialyzer work? Blood flows from the patient’s body into the 
machine. There is a cleansing fluid, called the dialyzate, that flows through the 
machine in the opposite direction to the blood. The blood and the dialyzate 
are separated by a semi-permeable membrane. See Figure 1.15. 

The membrane in the dialyzer has minute pores which will not allow the 
passage of blood but will allow the passage of the waste matter (which has 
much smaller molecules). The design of the dialysis machine concerns the flow 
rate of the waste material through the membrane. That flow is determined by 
the differences in concentration (of the waste material) on either side of the 
membrane. Of course the flow is from high concentration to low concentration. 
Refer again to the figure. 

Of course the physician and the patient care about the rate at which the 
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FIGURE 1.15 
A dialysis machine. 


waste material is removed. This rate will depend on (i) the flow rate of blood 
through the dialyzer, (ii) the flow rate of dialyzate through the dialyzer, (iii) 
the capacity of the dialyzer, and (iv) the permeability of the membrane. 

It is convenient (and plausible) for our analysis here to take the capacity of 
the dialyzer and the permeability of the membrane to be fixed constants. Our 
analysis will focus on the dependence of the removal rate (of the waste) on the 
flow rate. We let x denote the horizontal position in the dialyzer (Figure 1.15). 
Our analysis centers on a small cross section of the dialyzer from position x 
to position to x + Az. We refer to the cross section of the total flow pictured 
in Figure 1.16 as an “element” of the flow. 

Clearly, from everything we have said so far, the most important variables 
for our analysis are the concentration of waste in the blood (call this quantity 
p(a)) and the concentration of waste in the dialyzate (call this quantity q(x)). 
There is in fact a standard physical law governing the passage of waste material 
through the membrane. This is Fick’s Law. The enunciation is: 


The amount of material passing through the membrane is proportional to 
the difference in concentration. 


Let us examine Figure 1.16 in order to understand the movement of con- 
centration of waste. The difference in concentration across ae (as one moves 
from the upper half of the figure to the lower half) is p(a) — q(x); therefore 
the transfer of waste mass through a section of the membrane of width 1 and 
length Az from blood solution to dialyzate solution in unit time is approxi- 
mately 


k|[p(x) — q(x)]- Aa. 
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FIGURE 1.16 
Cross section of the flow. 


The constant of proportionality & is independent of «. Now we must consider 
the mass change in the “element” a(@Ce in unit time. Now 


mass flow acrossay _ mass passing through mass flow across (36 


into element membrane 76 out of element. 


Let Fg denote the constant rate of flow of blood through the dialyzer. 
Then we may express this relationship in mathematical language as 


Fp - p(x) = k[p(x) — q(2)] A + Fe: p(e+ Az). 
Rearranging this equation, we have 


Py: (MERE APO) —tlple) — ato). 


Now it is natural to let Ax — 0 to obtain the differential equation 


dp 
Fez, =~ kp 4): (1.12.4) 


This last analysis, which led to equation (1.12.4), was based on an exam- 
ination of the flow of the blood. We may perform a similar study of the flow 
of the dialyzate to obtain 


dq 
-For. =k(p—q) (1.12.5) 


(note that the presence of the minus sign comes from the fact that the blood 
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flows in the opposite direction as the dialyzate). Of course F’p is the flow rate 
of dialyzate through the machine. 
Now we add equations (1.12.4) and (1.12.5) to obtain 


dp dq k k 


de a F, D+ ze & q)- 


Notice that p and q occur in this equation in an antisymmetric manner. Thus 
it is advantageous to make the substitution r = p— q. We finally obtain 


al =-ar, (1.12.6) 
dx 


where a= k/Fp—k/Fp. 
This equation is easily solved with separation of variables. The result is 


r(a) = Ae~*”, (1.12.7) 


where A is an arbitrary constant. Of course we wish to relate this solution to 
p and to q. Look again at equation (1.12.4). We see that 


Integration yields 


k 
p=B+——-e™, (1.12.8) 
a 


where B is an arbitrary constant. We now combine (1.12.7) and (1.12.8), 
recalling that r = p— q = Ae~°*, to obtain that 


kA 
are 


ar 


i Raee: een 


Finally, we must consider the initial conditions in the problem. We suppose 
that the blood has initial waste concentration po and the dialyzate has initial 
waste concentration 0. Thus 


p = po atxr=0 
q = 0 atr=L. 


Here L is the length of the dialyzer machine. Some tedious algebra, applied 
in a by-now-familiar manner, finally tells us that 


is (e-*"/ Fp) = (€-* / Fs) 
p(x) = Po (eR ee) 

ey GE Pehle 
ae Gaoeon 


These two equations represent a definitive analysis of the concentrations 
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of waste in the blood and in the dialyzate. To interpret these, we observe that 
the amount of waste removed from the blood in unit time is 


L L 
| k[p(a) —q(a)|)du = —-Fp — dx by (1.12.4) 


= —Fs|po — p(L)). 
Those who design dialyzers focus their attention on the “clearance” C1, 
which is defined to be 


Cl= = ipo ~p(L)]. 


Our equations for p and q yield, after some calculation, that 


Cl=F Lae 
~ oP (hyl bee ee 
Here 


kL 
aL = —(1 —Fp/Fp). 
FB 


The actual design of a dialyzer would entail testing these theoretical re- 
sults against experimental data, taking into account factors like the variation 
of k with x, the depth of the channels, and variations in the membrane. 


TE 


Problems for Review and Discovery 


A. Drill Exercises 


1. Find the general solution to each of the following differential equations. 
(a) xy! +y=a 
(b) 2?y’+y=2? 


ody _ 
(c) zr dx 
(d) seer - SY = secy 
(e:) wf te 
dx x2*—y? 
dy «+2y 
f) — = —— 
) dx 2w-y 


(g) 2cydx +27 dy=0 
(h) —sinzsiny dz + cosxcosy dy = 0 


PROBLEMS FOR REVIEW AND DISCOVERY 


2. 


Solve each of the following initial value problems. 


(a) zy’-y=2¢, y(0)=1 
(b). ay" =2y S30". yl) =2 


() ~PH=2, y-1)=3 
(d) ever H = esey, y(n/2) =1 
dy «ty 


(e) B= 4, wat 
dy x? + 2Qy? 

f) “2 22 14d, =i 

© 2-554) 


(g) 2ecosydr—x’sinydy=0, y(1)=1 
1 x 

h) —dx——dy=0, y(0)=2 

(h) ji P (0) 


Find the orthogonal trajectories to the family of curves y = c(x? + 1). 
Use the method of reduction of order to solve each of the following dif- 
ferential equations. 

(a) y-y”—(y')? =0 

(b) xy” = y! — 2y')? 

(c) yy" +y'=0 

(d) xy” — 3y’ = 5x 


B. Challenge Problems 


1. 


A tank contains 50 gallons of brine in which 25 pounds of salt are dis- 
solved. Beginning at time t — 0, water runs into this tank at the rate of 
2 gallons per minute; the mixture flows out at the same rate through a 
second tank initially containing 50 gallons of pure water. When will the 
second tank contain the greatest amount of salt? 

A natural extension of the first-order linear equation 


y' = p(x) + a(x)y 
is the Riccati equation 
y = p(x) +4(x)y + r(x)y’. 


In general, this equation cannot be explicitly solved by elementary meth- 
ods. However, if a particular solution yi(x) is known, then the general 
solution has the form 


y(x) = w(x) + 2(2), 
where z(x) is the general solution of the associated Bernoulli equation 
2’ — (q+ 2ryi)z=rz’. 


Prove this assertion, and use this set of techniques to find the general 
solution of the equation 


r_ sy 


y= italy — 2°. (*) 


[Hint: The equation (*) has yi(a) = x as a particular solution.] 
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3. The propagation of a single act in a large population (for instance, buy- 
ing a Lexus rather than a Cadillac) often depends partly on external 
circumstances (e.g., price, quality, and frequency-of-repair records) and 
partly on a human tendency to imitate other people who have already 
performed the same act. In this case the rate of increase of the propor- 
tion y(t) of people who have performed the act can be expressed by the 
formula 

dy 

He A Ws) + Ty], (#*) 
where s(t) measures the external stimulus and J is a constant called the 
imitation coefficient. 


(a) Notice that (**) is a Riccati equation (see the last exercise) and that 
y = 1 is a particular solution. Use the result of the last exercise to 
find the Bernoulli equation satisfied by z(t). 

(b) Find y(t) for the case in which the external stimulus increases 
steadily with time, so that s(t) = at for a positive constant a. Leave 
your answer in the form of an integral. 

4. If Riccati’s equation from Exercise 2 above has a known solution y;(x), 
then show that the general solution has the form of the one-parameter 
family of curves 

_ cf(x) + 9(@) 


~ eF(x) + G(a2)° 


Show, conversely, that the differential equation of any one-parameter fam- 
ily of this form is a Riccati equation. 


5. It begins to snow at a steady rate some time in the morning. A snow 
plow begins plowing at a steady rate at noon. The plow clears twice as 
much area in the first hour as it does in the second hour. When did it 
start snowing? 


C. Problems for Discussion and Exploration 


1. A rabbit starts at the origin and runs up the right branch of the parabola 
y = x’ with speed a. At the same time a dog, running with speed 8, starts 
at the point (c,0) and pursues the rabbit. Write a differential equation 
for the path of the dog. 


2. Consider the initial value problem 


dy __sin(ay) 
dx 1+a?+y?" 


This equation cannot be solved by any of the methods presented in this 
chapter. However, we can obtain some information about solutions by 
using other methods. 


(a) On a large sheet of graph paper draw arrows indicating the direction 
of the curve at a large number of points. For example, at the point 
(1,1) the equation tells us that 


dy _ sinl 
de 3 
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Draw a little arrow with base at the point (1,1) indicating that the 
curve is moving in the indicated direction. 


Do the same at many other points. Connect these arrows with “flow 
lines.” (There will be many distinct flow lines, corresponding to dif- 
ferent initial conditions.) Thus you obtain a family of curves, repre- 
senting the different solutions of the differential equation. 

(b) What can you say about the nature of the flow lines that you ob- 
tained in part (a)? Are they curves that you can recognize? Are they 
polynomial curves? Exponential curves? 

(c) What does your answer to part (b) tell you about this problem? 


3. Suppose that the function F(x, y) is continuously differentiable (i-e., con- 
tinuous with continuous first derivatives). Show that the initial value 
problem 

dy 
“J _ F(a2,y), y(0)= 
Fe TE (t¥), yO) = Yo 


has at most one solution in a neighborhood of the origin. 


Taylor & Francis 
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Second-Order Linear Equations 


e Second-order linear equations 

e The nature of solutions of second-order linear equations 
e General solutions 

e Undetermined coefficients 

e Variation of parameters 

e Use of a known solution 

e Vibrations and oscillations 

e Electrical current 

e Newton’s law of gravitation 

e Kepler’s laws 


e Higher-order equations 


a 


2.1 Second-Order Linear Equations with Constant Co- 
efficients 


Second-order linear equations are important because (considering Newton’s 
second law) they arise frequently in engineering and physics. For instance, 
acceleration is given by the second derivative, and force is mass times accel- 
eration. 

In this section we learn about second-order linear equations with constant 
coefficients. The “linear” attribute means, just as it did in the first-order situa- 
tion, that the unknown function and its derivatives are not multiplied together, 
are not raised to powers, and are not the arguments of other functions. So, 
for example, 

y” — 3y' + by = 0 


67 
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is second-order linear while 

sin(y”) — y’ +5y =0 
and 

yy” +4y' + 3y=0 


are not. 

The “constant coefficient” attribute means that the coefficients in the 
equation are not functions—they are constants. Thus a second-order linear 
equation with constant coefficient will have the form 


ay” + by’ +cy=d, (2.1.1) 


where a,b, c,d are constants. 

We in fact begin with the homogeneous case; this is the situation in which 
d = 0. We solve equation (2.1.1) by a process of organized guessing: any 
solution of (2.1.1) will be a function that cancels with its derivatives. Thus it 
is a function that is similar in form to its derivatives. Certainly exponentials 
fit this description. Thus we guess a solution of the form 


yre 


Plugging this guess into (2.1.1) gives 


1 / 
a(er) + 6(e*) + o(e*) =0. 
Calculating the derivatives, we find that 
are" + bre™™ + ce™ =0 
or 
[ar? + br +c]-e™™ =0. 


Of course the exponential never vanishes. Thus this last equation can only 
be true (for all 2) if 
ar’? +br+c=0. 


This is just a quadratic equation (called the associated polynomial equation),+ 
and we may solve it using the quadratic formula. This process will lead to our 


solution set. 
EXAMPLE 2.1.2 Solve the differential equation 


y” — 5y' + 4y =0. 


1Some texts will call this the characteristic polynomial, although that terminology has 
other meanings in mathematics. 
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Solution: Following the paradigm just outlined, we guess a solution of the 
form y = e’”. This leads to the quadratic equation for r given by 


r?—5r+4=0. 
Of course this factors directly to 
(r—1)(r —4) =0, 


sor=1,4. 
Thus e” and e*” are solutions to the differential equation (you should 
check this assertion for yourself). A general solution is given by 


y=A-e +B-e*, (2.1.2.1) 


where A and B are arbitrary constants. The reader may check that any func- 
tion of the form (2.1.2.1) solves the original differential equation. Observe that 
our general solution has two undetermined constants, which is consistent with 
the fact that we are solving a second-order differential equation. | 


REMARK 2.1.3 Again we see that the solving of a differential equation leads 
to a family of solutions. In the last example, that family is indexed by two 
parameters A and B. As we shall see below (especially Section 2.5), a typ- 
ical physical problem will give rise to two initial conditions that determine 
those parameters. The Picard Existence and Uniqueness Theorem gives the 
mathematical underpinning for these ideas. 


EXAMPLE 2.1.4 Solve the differential equation 
2y" + 6y’ + 2y =0. 
Solution: The associated polynomial equation is 
Qr? + 6r+2=0. 


This equation does not factor in any obvious way, so we use the quadratic 
formula: 


—64V67—4-2-2  -64V20  -642/5  -34 V5 


Tra —-—_RaeF C—O roo SO Ss 


2-2 7 4 4 2 
Thus the general solution to the differential equation is 


—3+V5 —3-VJ5 
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EXAMPLE 2.1.5 Solve the differential equation 
y” — 6y' + 9y =0. 
Solution: In this case the associated polynomial is 
r?—6r+9=0. 


This algebraic equation factors as (r — 3) - (r — 3) = 0 hence has the single 
solution r = 3. But our differential equation is second order, and therefore we 
seek two independent solutions. 

In the case that the associated polynomial has just one root, we find the 
other solution with an augmented guess: Our new guess is y = x - e®”. (See 
Section 2.4 for an explanation of where this guess comes from.) The reader 
may check for himself/herself that this new guess is also a solution. So the 
general solution of the differential equation is 


y= A+" + B-xe™*. a 


As a prologue to our next example, we must review some ideas connected 
with complex exponentials. Recall that 


8 


2 3 oO 
7 


es a ee ee ee 
| tee rr a =), 
J 


ma 


This equation persists if we replace the real variable x by a complex variable 


z. Thus 
2 3 


z | f= te ve 
Omak yt 


Now write z = x+7y, and let us gather together the real and imaginary parts 
of this last equation: 


e? _ erty 
e* -e 
- . (iy)? . (iy)? , (iy)? 
=e (1+iy4 aI =F 31 ++ 1 Tee 


2 4 3 5 
_ 2 yy i aes 
= e {(-E+ b+) ai(y- O48 + )} 


e” (cosy + isin v) ; 


Taking x = 0 we obtain the famous identity 


e'Y =cosy+isiny. 
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This equation—much used in mathematics, engineering, and physics—is 
known as Euler’s formula, in honor of Leonhard Euler (1707-1783). We shall 
also make considerable use of the more general formula 


erty — e (cos y+isin v) . 


In using complex numbers, the reader should of course remember that the 
square root of a negative number is an imaginary number. For instance, 


V—-4= £27 and V—25 = +51. 


EXAMPLE 2.1.6 Solve the differential equation 


Ay" + 4y’ + 2y=0. 
Solution: The associated polynomial is 
4r? + 4r+2=0. 


We apply the quadratic formula to solve it: 


4+ V42-—4-4.2 4+ /-16 4+4/ Lt 4+ 41 —l+i 


CS Fs FE 


2-4 7 8 8 8 2 


Thus the solutions to our differential equation are 
—1+i, na y=e or”, 
A general solution is given by 


—1+4i -1 


y=A-e 2? *“+B-e aa 


Using Euler’s formula, we may rewrite this general solution as 


y= A- et/2eit/2 4B. e—#/2p—ta/2 
= A-e~*/?\cos #/2 + isin 2/2] + Be~*/?[cos «/2 — isinx/2]. (2.1.6.1) 


We shall now use some propitious choices of A and B to extract meaningful 
real-valued solutions. First choose A = 1/2, B = 1/2. Putting these values in 
equation (2.1.6.1) gives 
—a/2 


y=e cos xz /2. 


Now taking A = —i/2, B =i/2 gives the solution 


y=e sine /2. 


As a result of this little trick, we may rewrite our general solution as 


y = E-e~*/? cosa/2+ F-e-*/*sina/2. 
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As usual, we invite the reader to plug this last solution into the differential 
equation to verify that it really works. a 


We conclude this section with a last example of a homogeneous, second- 
order, linear ordinary differential equation with constant coefficients, and with 
complex roots, just to show how straightforward the methodology really is. 


EXAMPLE 2.1.7 Solve the differential equation 
y —2y' + 5y=0. 
Solution: The associated polynomial is 
7? —2r+5=0. 
According to the quadratic formula, the solutions of this equation are 


24 (—2)2-—4-1-5 2441 1493 
TS =1lraZ. 
2 2 : 
Hence the roots of the associated polynomial are r = 1 + 27 and 1 — 2i. 
According to what we have learned, two independent solutions to the 
differential equation are thus given by 


y = e* cos2x and y=e*sin2z. 
Therefore the general solution is given by 
y = Ae” cos2x + Be* sin 2z. 


Please verify this solution for yourself. i] 


(I 


Exercises 


1. Find the general solution of each of the following differential equations. 


(a) y"+y'—6y=0 (j) y” — by’ + 25y = 0 
(b) y” +2y’+y=0 (k) 4y” + 20y’ + 25y = 0 
(c) y”+8y=0 (lI) y+ 2y'+3y=0 
(d) 2y” — 4y' + 8y =0 (m) y” = 4y 

(e) y” —4y'+4y =0 (n) 4y” — 8y'+ Ty =0 
(f) y” —9y' + 20y = 0 (0) 2y”+y'—y=0 


(g) 2y”+2y’+3y=0 (p) 16y” — 8y'+y=0 
(h) 4y” —12y’+9y =0 (q) y’ +4y’+5y =0 
(i) y’+y'=0 (r) y” +4y’—5y=0 


2.2. METHOD OF UNDETERMINED COEFFICIENTS 73 


2. Find the solution of each of the following initial value problems: 


(a) y” —5y'+6y =0, y(1) = e? and y'(1) = 3e? 

(b) y” —6y'+5y=0, y(0) = 3 and y/(0) = 11 

(c) y"” —6y'+ 9y =0, y(0) =0 and y'(0) =5 

(d) y+ 4y' + 5y =0, y(0) = 1 and y'(0) =0 

(e) y” +4y’ +2y =0, y(0) = —1 and y’(0) = 24+3V2 
(f) y”+8y' —9y =0, y(1) = 2 and y’(1) =0 


3. Show that the general solution of the equation 
y" + Py’ +Qy=0 
(where P and Q are constant) approaches 0 as x — +00 if and only if P 


and Q are both positive. 
4. Show that the derivative of any solution of 
y + Py’ +Qy=0 
(where P and Q are constant) is also a solution. 
5. The equation 
xy" + pay’ + qy =0, 
where p and q are constants, is known as Euler’s equidimensional equa- 
tion. Show that the change of variable « = e* transforms Euler’s equation 


into a new equation with constant coefficients. Apply this technique to 
find the general solution of each of the following equations. 


(a) ay” +3ay’+10y =0 (f) «?y” + 2xy' — by =0 
(b) 2x7y" + 10ry’ + 8y = 0 (g) xy” + 2xy’ + 3y =0 
(c) ay” +2ary’ — 12y =0 (h) 2?y” + ay’ —2y =0 

(d) 4x?y” —3y=0 (i) 2?y” + ay’ — 16y =0 


(e) a?y” — 3ay' + 4y =0 


6. Find the differential equation of each of the following general solution 


sets. 
(a) Ae” + Be~** (e) Ae®* + Be~* 
(b) A+ Be” (f) Ae~* + Be~** 
(c) Ae?* + Be*® (g) Ae?” + Be~?* 
(d) Ae” cos 3a + Be® sin 3x (h) Ae~** cosa + Be~* sin x 


a 


2.2 The Method of Undetermined Coefficients 


“Undetermined coefficients” is a method of organized guessing. We have al- 
ready seen guessing, in one form or another, serve us well in solving first-order 
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linear equations and also in solving homogeneous second-order linear equa- 
tions with constant coefficients. Now we shall expand the technique to cover 
inhomogeneous second-order linear equations. 

We must begin by discussing what the solution to such an equation will 
look like. Consider an equation of the form 


ay” + by’ + cy = f(a). (2.2.1) 


Suppose that we can find (by guessing or by some other means) a function 
y = yo(x) that satisfies this equation. We call yo a particular solution of 
the differential equation. Notice that it will not be the case that a constant 
multiple of yo will also solve the equation. In fact, if we consider y = A - yo 
and plug this function into the equation, then we obtain 


alAyo]” + b[Ayo]’ + c[Ayo] = Alayg + byg + cyo] = A- f. 


We see that, if A #1, then A- yo is not a solution. But we expect the solution 
of a second-order equation to have two free constants. Where will they come 
from? 

The answer is that we must separately solve the associated homogeneous 
equation, which is 

ay” + by’ +cy=0. 
If y; and ye are solutions of this equation then of course (as we learned in 
the last section) A-y; + B- ye will be a general solution of this homogeneous 
equation. But then the general solution of the original differential equation 
(2.2.1) will be 
Y= PAs yer Bye. 

We invite the reader to verify that, no matter what the choice of A and B, 
this y will be a solution of the original differential equation (2.2.1). 

These ideas are best hammered home by the examination of some exam- 
ples. 


EXAMPLE 2.2.2 Find the general solution of the differential equation 
y’ +y=sine. (2.2.2.1) 


Solution: We might guess that y = sin or y = cosa is a particular solution 
of this equation. But in fact these are solutions of the homogeneous equation 


y" +y=0 


(as we may check by using the techniques of the last section, or just by direct 
verification). So if we want to find a particular solution of (2.2.2.1) then we 
must try a bit harder. 

Inspired by our experience with the case of repeated roots for the second- 
order, homogeneous linear equation with constant coefficients (as in the last 
section), we instead will guess 


yo=a-xcosx+P-axsine 
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for our particular solution. Notice that we allow arbitrary constants in front 
of the functions «cos x and «sin xz. These are the “undetermined coefficients” 
that we seek. 

Now we simply plug the guess into the differential equation and see what 
happens. Thus 


[a-xcosx+-axsina|” +[la-xcosxrt+ 3-xsinaz] =sinz 


or 


a(2(—sin z)+2(— cos x))+8(2cos x+a(—sinxz))+[axcosx+Bxsin 2] = sinx 
or 
(—2a) sinx + (28) cosz+ (—8+ B)xsinx + (—a+a)xcosx = sinc. 
We see that there is considerable cancellation, and we end up with 
—2asinxz+26cosxz =sing. 
The only way that this can be an identity in x is if -2a = 1 and 26 = 0 or 


a= -—1/2 and 6 =0. 
Thus our particular solution is 


1 
Yo = — 5X COs x 
and our general solution is 
1 , 
y= —pecosz + Acose + Bsina. ai 


REMARK 2.2.3 As usual, for a second-order equation we expect, and find, 
that there are two unknown parameters that parametrize the set of solutions 
(or the general solution). Notice that these are not the same as the “undeter- 
mined coefficients” that we used to find our particular solution. 


EXAMPLE 2.2.4 Find the solution of 
y” —y — 2y = 42? 
that satisfies y(0) = 0 and y/(0) = 1. 
Solution: The associated homogeneous equation is 
y" —y' —2y =0 
and this has associated polynomial 


r?—r—2=0. 
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The roots are obviously r = 2,—1 and so the general solution of the homoge- 
neous equation is y = A- e?” + B-e-*. 

For a particular solution, our guess will be a polynomial. Guessing a 
second-degree polynomial makes good sense, since a guess of a higher-order 
polynomial is going to produce terms of high degree that we do not want. Thus 
we guess that yo(2) = ax? + Bx + y. Plugging this guess into the differential 
equation gives 


[ox? + Baty)" — lox? + Bx +]! — 2[ax? + Ba +4] = 42? 


or 
[2a] — [a - 2x + 6] — [2ax? + 262 + 2y] = 42”. 


Grouping like terms together gives 


2ax* + [—2a — 26)z + [2a — 8 — 24] = 42”. 


As a result, we find that 


-2a = 
—2a-26 = 
2a—B-2y = 0. 


This system is easily solved to yield a = —2, 0 = 2, y = —3. So our 
particular solution is yo(z) = —2a2? + 2x — 3. The general solution of the 
original differential equation is then 


y(x) = (—2a7 + 2a —3) + A-e** + Be. (2.2.4.1) 


Now we seek the solution that satisfies the initial conditions y(0) = 0 and 
y’(0) = 1. These translate to 


0 = y(0) = (-2-07+2-0-3)+A-e°+B-e 


and 


1=y'(0) = (-4-0+2-0)+2A-e°-B-e°. 
This gives the equations 


-3+A+B 
2+2A-B. 


rE oO 
II 


Of course we can solve this system quickly to find that A = 2/3, B = 7/3. 
In conclusion, the solution to our initial boundary value problem is 
2 7 


= 24 i Lee 2. pret 
y(a) = (—2x? + 2x ate ete a 


gabe 
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REMARK 2.2.5 Again notice that the undetermined coefficients a@ and 3 and 
that we used to guess the particular solution are not the same as the param- 
eters that gave the two degrees of freedom in our general solution (2.2.4.1). 
Further notice that we needed those two degrees of freedom so that we could 
meet the two initial conditions. 


a 


Exercises 


1. Find the general solution of each of the following equations. 


(a) y” +3y' — 10y = 6e*” 
(b) y” +4y = 3sinz 
(c) y” + 10y’ + 25y = 14e7°” 
(d) y” —2y' + 5y = 25a? + 12 
(e) y” —y' — 6y = 20e°** 
(f) y” — 3y' +2y = 14sin 2x — 18 cos 2x 
(g) y’ +y=2cosz 
(h) y” — 2y’ = 122 — 10 
(i) y” — 2y' +y = 6e* 
(ji) y” — 2y’ +2y =e* sing 
(k) y” +y' = 1024 +2 
2. Find the solution of the differential equation that satisfies the given initial 
conditions. 
(a) y”—3y'+y=2, y(0) = 1, y'(0) =0 
uw / if 
(b) y +4y'+6y=cosz, — (0) = 0, y'{0) =2 
c =sinz, 115 1)=1 
y ty: : . y( ; a y'( ) ey 
(d) y —3y +2y=0, y(-1) = 0, Eo ls 
(ec) yy ty=e, y(0) =1, y (0) =0 
(f) y+ 2y'+y=1, y(1) =1, y'(1) =0 
3. If k and b are positive constants, then find the general solution of 
y+ key = sin bz. 
4. If yi and y2 are solutions of 


and 


y” + P(x)y' + Q(«)y = Ri(z) 


We 


y 4 


P(x)y' + Q(a)y = Ra(2), 


respectively, then show that y = yi + y2 is a solution of 


y” + P(a)y’ + Q(a)y = Ri(a) + Ra(2). 


78 CHAPTER 2: SECOND-ORDER LINEAR EQUATIONS 


This is called the principle of superposition. Use this idea to find the 
general solution of 
(a) y” +4y = 4cos 2x + 6cos x + 8x? 
(b) y” + 9y = 2sin 3x + 4sin x — 26e7?” 
5. Use your symbol manipulation software, such as Maple or Mathematica, 


to write a routine for solving for the undetermined coefficients in the 
solution of an ordinary differential equation, once an initial guess is given. 


a 


2.3. The Method of Variation of Parameters 


Variation of parameters is a method for producing a particular solution to 
an inhomogeneous equation by exploiting the (usually much simpler to find) 
solutions to the associated homogeneous equation. 

Let us consider the differential equation 


y” + p(a)y' + q(x)y = r(a2). (2.3.1) 


Assume that, by some method or other, we have found the general solution 
of the associated homogeneous equation 


y" + p(x)y' + a(x)y = 0 
to be 

y = Ayi (x) + Bya(z) . 
What we do now is to guess that a particular solution to the original equation 
(2.3.1) is 

yo(x) = vi(x) -y1(x) + v2(x) - yo(a) (2.3.2) 
for some choice of functions v1, v2. 
Now let us analyze this guess. We calculate that 


Yo = [iyi + ory] + [vgye + vay] = [viyr + vay] + [vry, + vays]. (2.3.3) 


We also need to calculate the second derivative of yo. But we do not want 
the extra complication of having second derivatives of v; and v2. So we shall 
mandate that the first expression in brackets on the far right-hand side of 
(2.3.3) is identically zero. Thus we have 


vpy1 + Vgy2 = 0. (2.3.4) 


Hence 
Yo = U1, + V2Y4 (2:3.5) 
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and we can now calculate that 
yo = [vyy + ory] + [veye + veys] - (2.3.6) 


Now let us substitute (2.3.2), (2.3.5), and (2.3.6) into the differential equa- 
tion. The result is 


(ivi tory |+[vsyot vauS]) robe) ( agl-beavs ) tale) (vise) = r(z). 


After some algebraic manipulation, this becomes 
i / i i. he | er (ees 
V1 (ui + py + aon) + v2 @ + PY + an) TOY Paya = « 


Since y1,y2 are solutions of the homogeneous equation, the expressions in 
parentheses vanish. The result is 


ViY, + U2¥o =P. O27) 


At long last we have two equations to solve in order to determine what v1 
and v2 must be. Namely, we focus on equations (2.3.4) and (2.3.7) to obtain 


/ / 
ViYy1 + Voy2 = 0, 


ror ye 63 = 
UY, 1 V2Yo =T- 


In practice, these can be solved for v},v4, and then integration tells us what 
V1, V2 must be. 

As usual, the best way to understand a new technique is by way of some 
examples. 


EXAMPLE 2.3.8 Find the general solution of 
y +y=csee. 


Solution: Of course the general solution to the associated homogeneous equa- 
tion is familiar. It is 
y(a) = Asinx+ Beosz. 


We of course think of y:(x) = sina and yo(a) = cos. In order to find a 
particular solution, we need to solve the equations 


vysinz+vu;coszx = 0 


vy (cosx) + v5(—sinz) = csc. 
This is a simple algebra problem, and we find that 


u(x) =cotz and v(x) =-1. 
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As a result, 
v1 (x) = In|sin a| and v9(“) = —a. 


(As the reader will see, we do not need any constants of integration.) 
The final result is then that a particular solution of our differential equa- 
tion is 


Yo(x) = v1 (x)y1 (x) + v2(x)y2(x) = (In| sin |) - sina + (—x) - cosa. 


We invite the reader to check that this solution actually works. The general 
solution of the original differential equation is thus 


y(x) = {(n |sinal) ‘sing + (—2) cost} + Asinz+ Bcosz. a 


REMARK 2.3.9 The method of variation of parameters has the advantage— 
over the method of undetermined coefficients—of not involving any guessing. 
It is a direct method that always leads to a solution. However, the integrals 
that we may need to perform to carry the technique to completion may, for 
some problems, be rather difficult. 


EXAMPLE 2.3.10 Solve the differential equation 
y” = y’ = 2y = Aa? 
using the method of variation of parameters. 


Solution: The reader will note that, in the last section (Example 2.2.4), we 
solved this same equation using the method of undetermined coefficients (or 
organized guessing). Now we shall solve it a second time by our new method. 
As we saw before, the homogeneous equation has the general solution 
y = Ae*™* + Be*, 
Thus yi(x) = e?* and yo(x) =e”. 
Hence we solve the system 


/ 42 fe 
vje" +use " =0, 


v; [2e?*] + vy[-e7*] = 42”. 
The result is 


4 4 
Oo zee and = u,(z) = — gre 


We may use integration by parts to then determine that 
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and 
Ay? Se, Bix 
vo(a“) = “a8 ae ae 
We finally see that a particular solution to our differential equation is 
yo(z) = v(x) - r(x) + v2(x)y2(z) 
_ (Ape = ae _ so) e2t 
Aas Bi Be 
( as — se" e 
207 Qe 1\ de? 8x8 
- (-$-F-3)+($+F 3) 
= -2¢7+22-3. 


In conclusion, the general solution of the original differential equation is 
y(x2) = {-20" + 2a — sf + Ae** + Be. 


As you can see, this is the same answer that we obtained in Section 2.2, Ex- 
ample 2.2.4, by the method of undetermined coefficients. (] 


REMARK 2.3.11 Of course the method of variation of parameters is a tech- 
nique for finding a particular solution of a differential equation. The general 
solution of the associated homogeneous equation must be found by a different 
technique. 


i 
Exercises 
1. Find a particular solution of each of the following differential equations. 


(a) y” +4y = tan 2x (d) yy” +2y! + 5y =e7* sec 2x 
(b) y" ats 2y’+y=e” Inz (e) Qy” + 3y’ +y=e°%* 
(c) y" —2y'—3y=64ae"" (f) yy” — 8y' + 2y= (1 +e°7)* 


2. Find a particular solution of each of the following differential equations. 


(a) y" +y=seca (e) y’+y=tane 

(b) y”+y=cot?x (f) y’ +y=secrtange 
(ec) y” +y = cot 2x (g) y’ +y=secucscr 
(d) y” +y=xcoszx 
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3. Find a particular solution of 
y" —2y'+y = 2a, 


first by inspection and then by variation of parameters. 

4. Find a particular solution of 
y—y —ty=e", 

first by undetermined coefficients and then by variation of parameters. 
5. Find the general solution of each of the following equations. 

(a) (2? — Ly” — ay’ + 2y = (2? — 1)? 

(b) (2? + a)y" + (2-27)y’ — (24+ 2)y =2(e +1) 

(c) (1-a)y” +ay!—y= (1-2)? 

(d) zy” —(1+a)y'+y=27e* 

(e) x?y — 2ay’ + 2y = xe" 


6. Use your symbol manipulation software, such as Maple or Mathematica, 
to find a particular solution to an ordinary differential equation, once the 
solutions to the homogeneous equation are given. 


a 


2.4 The Use of a Known Solution to Find Another 


Consider a general, second-order, linear, homogeneous equation of the form 


y’ +p(x)y’ +q(x)y =0. (2.4.1) 


It often happens—and we have seen this in our earlier work—that one can 
either guess or elicit one solution to the equation. But finding the second 
independent solution is more difficult. In this section we introduce a method 
for finding that second solution. 

In fact we exploit a notational trick that served us well in Section 2.3 on 
variation of parameters. Namely, we shall assume that we have found the one 
solution y; and we shall suppose that the second solution we seek is y2 = v-y1 
for some undetermined function v. Our job, then, is to find v. 

Assuming, then, that y; is a solution of (2.4.1), we shall substitute yo = 
v- yy, into (2.4.1) and see what this tells us about calculating v. We see that 


[v- yi)” + p(x) - [v- yi)’ + a(x) - [v- yi] =0 
or 
[vy +20’ yy tu yt] + plz): [oy tu: yi) +a(z)-[v- yi] =0. 
We rearrange this identity to find that 


v- [yy + p(t) - 1 + a(x)yi] + [v" > yi] + [v’ - (2y, + p(x) -y1)] =0- 
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Now we are assuming that y, is a solution of the differential equation (2.4.1), 
so the first expression in brackets must vanish. As a result, 


[v" yi] + [v'- Qy +p-yn)] =0. 


In the spirit of separation of variables, we may rewrite this equation as 


py" Yi 
— = —924 — gy, 


v! Y1 


Integrating once, we find that 


Inv’ = -2Iny, — [ro dx 


or 
a de Soles 
YI 


Applying the integral one last time yields 
/ 1 — ff p(x) dx 
v= | ze dz. 
YY 
What does this mean? It tells us that a second solution to our differential 


equation, given that y; is one solution, is given by 


1 

yo(x) = v(x) - yi (x) = / we ae as| -yi(x). (2.4.2) 
1 

In order to really understand what this means, let us apply the method to 
some particular differential equations. 


EXAMPLE 2.4.3 Find the general solution of the differential equation 
y” —6y' + 9y = 0. 


Solution: When we first encountered this type of equation in Section 2.1, we 
learned to study the associated polynomial 


r?—6r+9=0. 


Unfortunately, the polynomial has only the repeated root r = 3, so we find 
just the one solution y;(2) = e?*. Where do we find another? 

In Section 2.1, we found the second solution by guessing. Now we have a 
more systematic way of finding that second solution, and we use it to test out 
the new methodology. Observe that p(#) = —6 and q(a) = 9. According to 
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formula (2.4.1), we can find a second solution yg = v- y, with 


v= [ae fre ae 
YW 


uf — {| -6dzx 
— |r ‘A 6 dx 
ie : eo dx 
= fracas. 


Thus the second solution to our differential equation is yz = v-y, = x-e 
and the general solution is therefore 


3a 


y= A-e** + B-xe**. 


This reaffirms what we learned in Section 2.1 by a different and more elemen- 
tary technique. | 


Next we turn to an example of a nonconstant coefficient equation. 


EXAMPLE 2.4.4 Find the general solution of the differential equation 


xy” + ay’—y=0. 


Solution: Differentiating a monomial once lowers the degree by 1 and differ- 
entiating it twice lowers the degree by 2. So it is natural to guess that this 
differential equation has a power of x as a solution. And y;(x) = x works. 

We use formula (2.4.2) to find a second solution of the form y2 = v- y1. 
First we rewrite the equation in the standard form as 


1 1 
yf y= = 0 
r x 
and we note then that p(x) = 1/z and q(x) = —1/a?. Thus 


ih Se Lvl) a dy 
YW 


1 
fache* dx 
x 


v(x) 


I 


I 
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In conclusion, yg = v- y, = [—1/(2x?)] - 2 = —1/(2x) and the general 
solution is 


ye) =A-24B-(—Z). : 


a 


Exercises 


1. Use the method of this section to find y2 and the general solution of each 
of the following equations from the given solution y,. 


(a) y"+y=0 , w(x) =sine 


(b) y"—-y=0 , yl(a)=e* 
2. The equation ry” +3y’ = 0 has the obvious solution yi = 1. Find y2 and 
find the general solution. 
3. Verify that y; = x? is one solution of x?y” + xy’ — 4y = 0, and then find 
y2 and the general solution. 
4. The equation 
(1—2")y” — 2ay' + 2y = 0 (*) 


is a special case, corresponding to p = 1, of the Legendre equation 


(1—2*)y” — 2xy' + p(pt l)y =0. 


Equation (*) has yi = x as a solution. Find the general solution. 


5. The equation 


xy” ry’ G i) y —0 (x) 


is a special case, corresponding to p = 1/2, of the Bessel equation 


ay” + ay’ + (a? —p?)y =0. 
Verify that yi(2) = «~\/? sin x is a solution of (x) for « > 0 and find the 
general solution. 


6. For each of the following equations, yi(”) = x is one solution. In each 
case, find the general solution. 
" x , 1 _ 
(a) y - sy +o =9 
(b) a?y” + 2ay' — 2y =0 
(c) ay” — a(x + 2)y' + (a +2)y =0 
7. Find the general solution of the differential equation 


/ 


y” — ay’ +ay=0. 
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8. Verify that one solution of the equation 


ay” — (2a + 1)y'+ («@+1)y =0 


is y1(x) = e®. Find the general solution. 
9. If y: is a nonzero solution of the differential equation 


y + P(z)y' + Q(x)y = 0 


and if ye =v-y1, with 
1 - fpde 
v(x) = | =-e Spade ge 
YW 
then show that yi and ye are linearly independent (that is, yi is not a 
multiple of y2). 


TT 


2.5 Vibrations and Oscillations 


When a physical system in stable equilibrium is disturbed, then it is subject to 
forces that tend to restore the equilibrium. The result can lead to oscillations 
or vibrations. It is described by an ordinary differential equation of the form 


an (t) dx 
dee OM ae 


In this section we shall learn how and why such an equation models the phys- 
ical system we have described, and we shall see how its solution sheds light 
on the physics of the situation. 


+ q(t)x =r(t). 


2.5.1 Undamped Simple Harmonic Motion 


Our basic example will be a cart of mass M attached to a nearby wall by 
means of a spring. See Figure 2.1. 

The spring exerts no force when the cart is at its rest position « = 0 
(notice that, contrary to custom, we are locating the origin to the right of the 
wall). According to Hooke’s law, if the cart is displaced a distance x, then the 
spring exerts a proportional force F, = —kx, where k is a positive constant 
known as Hooke’s constant. Observe that, if « > 0, then the cart is moved to 
the right and the spring pulls to the left; so the force is negative. Obversely, 
if « < 0 then the cart is moved to the left and the spring resists with a force 
to the right; so the force is positive. 

Newton’s second law of motion says that the mass of the cart times its 
acceleration equals the force acting on the cart. Thus 
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FIGURE 2.1 
Hooke’s law. 


As a result, 
ax a k 
dt? M 
It is useful to let a = \/k/M (both k and M are positive) and thus to write 
the equation as 


c=0. 


L.'s 
We +a°rn = 0. 

Of course this is a familiar differential equation for us, and we can write 
its general solution immediately: 


x(t) = Asinat + Bcosat. 


Now suppose that the cart is pulled to the right to an initial position of 
x = xp > 0 and then is simply released (with initial velocity 0). Then we have 
the initial conditions 


dx 
— d — = 
x(0) = 2 an i (0) =0 
Thus 
xo =  Asin(a-0)+ Bcos(a- 0) 
0 = Aacos(a-0)— Basin(a- 0) 
or 


ri = B 
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We conclude that B = x9, A = 0, and we find the solution of the system 
to be 
x(t) = xp cosat. 


In other words, if the cart is displaced a distance xo and released, then the 
result is a simple harmonic motion (described by the cosine function) with 
amplitude xo (i.e., the cart glides back and forth, xo units to the left of the 
origin and then xo units to the right), and with period T = 27/a (which 
means that the motion repeats itself every 27/a units of time). 

The frequency f of the motion is the number of cycles per unit of time, 
hence f -T = 1, or f = 1/T = a/(2z). It is useful to substitute back in the 
actual value of a so that we can analyze the physics of the system. Thus 


amplitude = xo 


Ir /M 
Vk 
Vk 


period = T = 


frequency = f = , 
a aa 
We see that, if the stiffness & of the spring is increased, then the period 
becomes smaller and the frequency increases. Likewise, if the mass M of the 
cart is increased then the period increases and the frequency decreases. 


2.5.2 Damped Vibrations 


It probably has occurred to the reader that the physical model in the last 
subsection is not realistic. Typically, a cart that is attached to a spring and 
released, just as we have described, will enter a harmonic motion that dies out 
over time. In other words, resistance and friction will cause the system to be 
damped. Let us add that information to the system. 

Physical considerations make it plausible to postulate that the resistance 
is proportional to the velocity of the moving cart. Thus 


where Fy denotes damping force and c > 0 is a positive constant that measures 
the resistance of the medium (air or water or oil, etc.). Notice, therefore, that 
when the cart is traveling to the right then dx/dt > 0 and therefore the force 
of resistance is negative (i.e., in the other direction). Likewise, when the cart 
is traveling to the left then dx/dt <0 and the force of resistance is positive. 

Since the total of all the forces acting on the cart equals the mass times 
the acceleration, we now have 

ax 


i een 2 
dt? Tad 
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In other words, 

dx a dx = k; =, 

d2'M dM” 
Because of convenience and tradition, we again take a = ,/k/M and we set 
b = c/(2M). Thus the differential equation takes the form 


This is a second-order, linear, homogeneous ordinary differential equation 
with constant coefficients. The associated polynomial is 


r? + 2br +a? =0, 


and it has roots 


—2b+ V4b? — 4a? 


5 b2 —a?. 


=-b4 


TM1,72 = 


Now we must consider three cases. 


CASE A. c? —4kM > 0: In other words, b? — a? > 0. We are assuming 
that the frictional force (which depends on c) is significantly larger than the 
stiffness of the spring (which depends on k). Thus we would expect the system 
to damp heavily. In any event, the calculation of r,,r2 involves the square 
root of a positive real number which is smaller than b, so that the values 
of —b + Vb? — a? are definitely negative. Thus r),r2 are distinct real (and 
negative) roots of the associated polynomial equation. 
Thus the general solution of our system in this case is 


= ryt rat 
x = Ae™" + Be™’, 


where (we repeat) 71,72 are negative real numbers. We apply the initial con- 
ditions (0) = ao, da/dt(0) = 0, just as in the last section (details are left to 
the reader). The result is the particular solution 


x(t) = = (me - ne) ‘ (2.5.1) 


T1 — 12 


Notice that, in this heavily damped system, no oscillation occurs (i.e., 
there are no sines or cosines in the expression for x(t)). The system simply 
dies out. Figure 2.2 exhibits the graph of the function in (2.5.1). 


CASE B. c? — 4kM = 0: In other words, b? — a? = 0. This is the critical 
case, where the resistance balances the force of the spring. We see that b = a 
(both are known to be positive) and ry = r2 = —b = —a. We know, then, that 
the general solution to our differential equation is 


a(t) = Ae~* + Bte™™. 
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FIGURE 2.2 
The motion dies out. 


When the standard initial conditions are imposed, we find the particular so- 
lution 
a(t) =2p)-e “(1+ at). 


We see that this differs from the situation in CASE A by the factor (1 + at). 
That factor of course attenuates the damping, but there is still no oscillatory 
motion. We call this the critical case. The graph of our new x(t) is quite 
similar to the graph already shown in Figure 2.2. 

If there is any small decrease in the viscosity, however slight, then the 
system will begin to vibrate (as one would expect). That is the next, and last, 
case that we examine. 


CASE C. c? —4kM <0: This says that b? — a? < 0. Now 0 <b < aand 
the calculation of 1,72 entails taking the square root of a negative number. 
Thus r1,7r2 are the conjugate complex numbers —b + iva? — b?. We set a = 
Va? —b? >0. 


Now the general solution of our system, as we well know, is 


a(t) = e—* (4 sinat + Bcos at) 


If we evaluate A, B according to our usual initial conditions, then we find the 
particular solution 


a(t) = ee (0 sin at + acos at) 
ray 
ONO oi ( u sin at + ————= cos t) 
= ———e ————- sina at). 
a Vaz + b2 Va? + 6? 
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FIGURE 2.3 
A damped vibration. 


It is traditional and convenient to set 6 = arctan(b/a). It follows that 


b 
———_ = sind 
Vere 
and e 
————_. = cos. 
a2 +4 b2 
With this notation, we can express the last equation in the form 
fet do Be 
x(t) = RE (sin # sin at + cos @ cos at) 
a 
D1 pe 
ee cos(at — 0). (2.5.2) 
ray 


As you can see, there is oscillation because of the presence of the cosine 
function. The amplitude (the expression that appears in front of cosine) clearly 
falls off—rather rapidly—with t because of the presence of the exponential. 
The graph of this function is exhibited in Figure 2.3. 

Of course this function is not periodic—it is dying off, and not repeating 
itself. What is true, however, is that the graph crosses the t-axis (the equilib- 
rium position « = 0) at regular intervals. If we consider this interval T (which 
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is not a “period,” strictly speaking) as the time required for one complete 
cycle, then aT’ = 27 so 


2 D 
ee (2.5.3) 


a \/k/M — c2/(4M?) | 


We define the number f, which plays the role of “frequency” with respect 
to the indicated time interval, to be 


1 1 k c? 


~ T IVM 4M?" 

This number is commonly called the natural frequency of the system. When 
the viscosity vanishes, then our solution clearly reduces to the one we found 
earlier when there was no viscosity present. We also see that the frequency of 
the vibration is reduced by the presence of damping; increasing the viscosity 
further reduces the frequency. 


2.5.3 Forced Vibrations 


The vibrations that we have considered so far are called free vibrations because 
all the forces acting on the system are internal to the system itself. We now 
consider the situation in which there is an external force F. = f(t) acting on 
the system. This force could be an external magnetic field (acting on the steel 
cart) or vibration of the wall, or perhaps a stiff wind blowing. Again setting 
mass times acceleration equal to the resultant of all the forces acting on the 
system, we have 
dx 
M+ a = Fst fat fe, 

where F, = f(t) is the external force. 

Taking into account the definitions of the various forces, we may write 
the differential equation as 


Pr dz 
—~ +c—+ka = fit). 
de | dt 
So we see that the equation describing the physical system is second-order 
linear, and that the external force gives rise to an inhomogeneous term on the 
right. An interesting special case occurs when f(t) = Fo-coswt, in other words 
when that external force is periodic. Thus our equation becomes 


ce dz 

Te + ars + kx = Fo-coswt. (2.5.4) 
If we can find a particular solution of this equation, then we can combine 
it with the information about the solution of the associated homogeneous 
equation in the last subsection and then come up with the general solution of 
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the differential equation. We shall use the method of undetermined coefficients. 
Considering the form of the right-hand side, our guess will be 


x(t) = Asinwt + Bcoswt. 


Substituting this guess into the differential equation gives 


2 
vem 


d 
7) [Asinwt + Bcoswt] + ca [Asinwt + Bcoswt| 


+k[Asinwt + Bcoswt|] = Fo -coswt. 
With a little calculus and a little algebra we are led to the algebraic equations 


weA+(k—w*M)B = Fo 
(k—-w*M)A—weB = 0. 


We solve for A and B to obtain 


wceFo aa B= (k — w?M) Fo 


A= —— pee ikea SSE See 
(k — w?M)? + we? (k — w?M)? + w2c? 


Thus we have found the particular solution 


Fi 
xo(t) = oe MTS (wesin wt + (k —w?M) cos wt) : 
Calculating as above, we may write this in a more useful form with the 
notation ¢ = arctan[we/(k — w?M)]. Thus 
Fo 
£o(t) = ———— - cos(wt — ). 2.5.5 
oft) (k — w2M)? 4+ wc? ( ®) ( ) 


If we assume that we are dealing with the underdamped system, which 
is CASE C of the last subsection, we find that the general solution of our 
differential equation with a periodic external forcing term is 


a(t) = e % (4 cosat + Bsin at) 


Fo 
se (Ere EER -cos(wt — ¢). 


We see that, as long as some damping is present in the system (that is, b 
is nonzero and positive), then the first term in the definition of x(t) is clearly 
transient (i.e., it dies as t > oo because of the exponential term). Thus, as 
time goes on, the motion assumes the character of the second term in 2(t), 
which is the steady-state term. So we can say that, for large t, the physical 


nature of the general solution to our system is more or less like that of the 
particular solution x(t) that we found. The frequency of this forced vibration 
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equals the impressed frequency (originating with the external forcing term) 
w/2m. The amplitude is the coefficient 


Fo 
J(k — w2M)? + wc? © 


This expression for the amplitude depends on all the relevant physical 
constants, and it is enlightening to analyze it a bit. Observe, for instance, that 
if the viscosity c is very small and if w is close to \/k/M (so that k — w?M is 
very small) then the motion is lightly damped and the external (impressed) 
frequency w/27 is close to the natural frequency 


(2.5.6) 


Then the amplitude is very large (because we are dividing by a number close 
to 0). This phenomenon is known as resonance. There are classical examples 
of resonance.” For instance, several years ago there was a celebration of the 
anniversary of the Golden Gate Bridge (built in 1937), and many thousands 
of people marched in unison across the bridge. The frequency of their footfalls 
was so close to the natural frequency of the bridge (thought of as a suspended 
string under tension) that the bridge nearly fell apart. A famous incident at 
the Tacoma Narrows Bridge has been attributed to resonance, although more 
recent studies suggest a more complicated combination of effects (see the 
movie of this disaster at http://www. ketchum.org/bridgecollapse. html). 


2.5.4 A Few Remarks about Electricity 


It is known that if a periodic electromotive force, F = Ep, acts in a simple 
circuit containing a resistor, an inductor, and a capacitor, then the charge Q 
on the capacitor is governed by the differential equation 
d? d 
1@@ , pl@ 


+R 


1 
les ayy © ie NR 9 ee t. 
qe dt + ae 0 COS W 


This equation is of course quite similar to equation (2.5.4) for the oscillat- 
ing cart with external force. In particular, the following correspondences (or 
analogies) are suggested: 


mass Mf «<— inductance L; 
viscosity c «<— resistance R; 
stiffness of spring k <—- reciprocal of capacitance G ; 
displacement x <— charge Q on capacitor. 


?One of the basic ideas behind filter design is resonance. 
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The analogy between the mechanical and electrical systems renders iden- 
tical the mathematical analysis of the two systems, and enables us to carry 
over at once all mathematical conclusions from the first to the second. In the 
given electric circuit we therefore have a critical resistance below which the 
free behavior of the circuit will be vibratory with a certain natural frequency, 
a forced steady-state vibration of the charge Q, and resonance phenomena 
that appear when the circumstances are favorable. 


Math Nugget 


Charles Proteus Steinmetz (1865-1923) was a mathemati- 
cian, inventor, and electrical engineer. He pioneered the use 
of complex numbers in the study of electrical circuits. Af- 
ter he left Germany (on account of his socialist political 
activities) and emigrated to America, he was employed by 
the General Electric Company. He soon solved some of GE’s 
biggest problems—to design a method to mass-produce elec- 
tric motors, and to find a way to transmit electricity more 
than 3 miles. With these contributions alone Steinmetz had 
a massive impact on mankind. 

Steinmetz was a dwarf, crippled by a congenital defor- 
mity. He lived in pain, but was well liked for his humanity 
and his sense of humor, and certainly admired for his scien- 
tific prowess. The following Steinmetz story comes from the 
Letters section of Life Magazine (May 14, 1965): 


96 
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Sirs: In your article on Steinmetz (April 23) you mentioned 
a consultation with Henry Ford. My father, Burt Scott, who 
was an employee of Henry Ford for many years, related to 
me the story behind the meeting. Technical troubles devel- 
oped with a huge new generator at Ford’s River Rouge plant. 
His electrical engineers were unable to locate the difficulty 
so Ford solicited the aid of Steinmetz. When “the little gi- 
ant” arrived at the plant, he rejected all assistance, asking 
only for a notebook, pencil and cot. For two straight days 
and nights he listened to the generator and made count- 
less computations. Then he asked for a ladder, a measur- 
ing tape, and a piece of chalk. He laboriously ascended the 
ladder, made careful measurements, and put a chalk mark 
on the side of the generator. He descended and told his 
skeptical audience to remove a plate from the side of the 
generator [at the marked spot] and take out 16 windings 
from the field coil at that location. The corrections were 
made and the generator then functioned perfectly. Subse- 
quently Ford received a bill for $10,000 signed by Steinmetz 
for G.E. Ford returned the bill acknowledging the good job 
done by Steinmetz but respectfully requesting an itemized 
statement. Steinmetz replied as follows: Making chalk mark 
on generator $1. Knowing where to make mark $9,999. Total 
due $10,000. 


Exercises 


1. 


Consider the forced vibration in the underdamped case, and find the 
impressed frequency for which the amplitude attains a maximum. Will 
such an impressed frequency necessarily exist? This value of the impressed 
frequency, when it exists, is called the resonance frequency. Show that the 
resonance frequency is always less than the natural frequency. 

Consider the underdamped free vibration described by formula (2.5.2). 
Show that x assumes maximum values for t = 0,7,2T,..., where T is the 
“period,” as given in formula (2.5.3). If 21 and x2 are any two successive 
maxima, then show that x1/r2 = e°’. The logarithm of this quantity, or 
bT, is known as the logarithmic decrement of the vibration. 

A spherical buoy of radius r floats half-submerged in water. If it is de- 
pressed slightly, then a restoring force equal to the weight of the displaced 
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water presses it upward; and if it is then released, it will bob up and down. 
Find the period of oscillation if the friction of the water is negligible. 

4. A cylindrical buoy 2 feet in diameter floats with its axis vertical in fresh 
water of density 62.4 lb./ft.2 When depressed slightly and released, its 
period of oscillation is observed to be 1.9 seconds. What is the weight of 
the buoy? 

5. Suppose that a straight tunnel is drilled through the Earth between two 
points on its surface. The tunnel passes through the center of the earth. 
If tracks are laid, then—neglecting friction—a train placed in the tunnel 
at one end will roll through the Earth under its own weight, stop at the 
other end, and return. Show that the time required for a single, complete 
round trip is the same for all such tunnels (no matter what the beginning 
and ending points), and estimate its value. If the tunnel is 2 miles long, 
then what is the greatest speed attained by the train on its journey? 

6. The cart in Figure 2.1 weighs 128 pounds and is attached to the wall by 
a spring with spring constant k = 641b./ft. The cart is pulled 6 inches 
in the direction away from the wall and released with no initial velocity. 
Simultaneously, a periodic external force Fe = f(t) = 32sin4t lb is 
applied to the cart. Assuming that there is no air resistance, find the 
position « = x(t) of the cart at time ¢. Note particularly that |x(t)| 
assumes arbitrarily large values as t — +oo. This phenomenon is known 
as pure resonance and is caused by the fact that the forcing function has 
the same period as the free vibrations of the unforced system. 

7. Use your symbol manipulation software, such as Maple or Mathematica, to 
solve the ordinary differential equation with the given damping term and 
forcing term. In each instance you should assume that both the damping 
and the forcing terms occur on the right-hand side of the differential 
equation and that t > 0. 


(a) damping = —e’dx/dt, f =sint + cos 2t 
(b) damping = —Intdx/dt, f =e! 
(c) damping = —[e’]-Intdx/dt, f = cos 2t 
(d) damping = —t°dx/dt, f =e7' 


Dr 
2.6 Newton’s Law of Gravitation and Kepler’s Laws 


Newton’s law of universal gravitation is one of the great ideas of modern 
physics. It underlies so many important physical phenomena that it is part of 
the bedrock of science. In this section we show how Kepler’s laws of planetary 
motion can be derived from Newton’s gravitation law. It might be noted that 
Johannes Kepler himself (1571-1630) used thousands of astronomical obser- 
vations (made by his teacher Tycho Brahe (1546-1601)) in order to formulate 
his laws. Kepler was a follower of Copernicus, who postulated that the planets 
orbited about the sun, but Brahe held the more traditional Ptolemaic view 
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that the Earth was the center of the orbits. Brahe did not want to let Kepler 
use his data, for he feared that Kepler would use the data to promote the 
Copernican theory. As luck would have it, Brahe died from a burst bladder 
after a night of excessive beer drinking at a social function. So Kepler was 
able to get the valuable numbers from Tycho Brahe’s family. 


KEPLER’S LAWS OF PLANETARY 
MOTION 


I. The orbit of each planet is an ellipse with the sun at one 
focus (Figure 2.4). 


II. The segment from the center of the sun to the center 
of an orbiting planet sweeps out area at a constant rate 
(Figure 2.5). 


III. The square of the period of revolution of a planet is 
proportional to the cube of the length of the major axis of 
its elliptical orbit, with the same constant of proportionality 
for any planet. (Figure 2.6). 


Interestingly, Copernicus believed that the orbits were circles (rather than 
ellipses, as we now know them to be). Newton determined how to derive 
the laws of motion analytically, and he was able to prove that the orbits 
must be ellipses (although it should be noted that the ellipses are very nearly 
circular—their eccentricity is very close to 0). Furthermore, the eccentricity 
of an elliptical orbit has an important physical interpretation. The present 
section explores all these ideas. 

It turns out that the eccentricities of the ellipses that arise in the orbits 
of the planets are very small, so that the orbits are nearly circles, but they 
are definitely not circles. That is the importance of Kepler’s first law. 

The second law tells us that, when the planet is at its apogee (furthest 
from the sun), then it is traveling relatively slowly, whereas at its perigee 
(nearest point to the sun), it is traveling relatively rapidly—Figure 2.7. In fact 
the second law is valid for any central force, and Newton knew this important 
fact. 

The third law allows us to calculate the length of a year on any given 
planet from knowledge of the shape of its orbit. 

In this section we shall learn how to derive Kepler’s three laws from New- 
ton’s inverse square law of gravitational attraction. To keep matters as simple 


2.6. KEPLER’S LAWS 99 


Earth 


FIGURE 2.4 
The elliptical orbit of the Earth about the sun. 


Earth 


FIGURE 2.5 
Area is swept out at a constant rate. 
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major axis 


FIGURE 2.6 
The square of the period is proportional to the cube of the major axis. 


Earth 


[fastest] 


FIGURE 2.7 
Apogee motion vs. perigee motion. 
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Earth 


FIGURE 2.8 
Polar coordinate system for the motion of the Earth about the sun. 


as possible, we shall assume that our solar system contains a fixed sun and just 
one planet (the Earth, for instance). The problem of analyzing the gravitation 
influence of three or more planets on each other is incredibly complicated and 
is still not thoroughly understood. 

The argument that we present is due to S. Kochen, and is used with his 
permission. 


2.6.1 Kepler’s Second Law 


It is convenient to derive the second law first. We use a polar coordinate 
system with the origin at the center of the sun. We analyze a single planet 
which orbits the sun, and we denote the position of that planet at time t by 
the vector R(t). The only physical facts that we shall use in this portion of 
the argument are Newton’s second law and the self-evident assertion that the 
gravitational force exerted by the sun on a planet is a vector parallel to R(t). 
See Figure 2.8. 

If F is force, m is the mass of the planet (Earth), and a is its acceleration 
then Newton’s second law says that 


F = ma=mR'(t). 


We conclude that R(t) is parallel to R(t) for every value of t. 
Now 


< (R(t) x R'(t)) = [R'(t) x R'(t)] + [R(t) x R”(d)]. 
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FIGURE 2.9 
The increment of area. 


Note that the first of these terms on the right is zero because the cross product 
of any vector with itself is zero. The second is zero because R(t) is parallel to 
R(t) for every t. We conclude that 


< (R(t) x Ri(t)) =0 
hence 
R(t) x R'(t) =C, (2.6.1) 


where C is a constant vector. Notice that this already guarantees that R(t) 
and R’(t) always lie in the same plane, hence that the orbit takes place in a 
plane. 

Now let At be an increment of time, AR the corresponding increment of 
position, and AA the increment of area swept out. Look at Figure 2.9. 

We see that AA is approximately equal to half the area of the parallelo- 
gram determined by the vectors R and AR. The area of this parallelogram is 
|| x AR||. Thus 


Bf RSE eget 
At 2 At At || 


Letting At — 0 gives 


dA 1 dR 1 
— = —|/R x —]| = =||C]| = constant. 
dt 2 
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We conclude that area A(t) is swept out at a constant rate. That is Kepler’s 
second law. 


2.6.2 Kepler’s First Law 


Now we write R(t) = r(t)u(t), where u is a unit vector pointing in the same 
direction as R and r is a positive, scalar-valued function representing the 
length of R. We use Newton’s inverse square law for the attraction of two 
bodies. If one body (the sun) has mass M and the other (the planet) has 
mass m then Newton says that the force exerted by gravity on the planet is 


GmM 
2 


u. 
rc 


Here G is a universal gravitational constant. Refer to Figure 2.10. Because 
this force is also equal to mR” (by Newton’s second law), we conclude that 


MR" = — —— u. 
r 
a GM 
RR" =- 3 U- 
Also q 
R'(t) = a) =rutru 
and 
(22) 2% u) = 2u-u’ 
dt dt - 
Therefore 
ul’. (2.6.2) 


Now, using (2.6.2), and the derivation of C from our discussion of Kepler’s 
second law, we calculate 


R’xC = R"xX(RXRE) 
GM 
= -—yux (ru x (r’u+ru’)) 
GM 


— / 
= Ta ux (ru x ru’) 


= —GM(ux(uxw)). 


We can determine the vector u x (u x u’). For, with formula (2.6.2), we 
see that u and wu’ are perpendicular and that u x u’ is perpendicular to both 
of these. Because u x (u x u’) is perpendicular to the first and last of these 
three, it must therefore be parallel to u’. It also has the same length as u’ 
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GmM 


FIGURE 2.10 
Newton’s universal law of gravitation. 
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FIGURE 2.11 
Calculations with u’. 


and, by the right-hand rule, points in the opposite direction. Look at Figure 
2.11. We conclude that u x (u x u’) = —w’, hence that 
R"xC=GMu'. 
If we antidifferentiate this last equality we obtain 
R'(t) x C= GM(u+Kk), 


where K is a constant vector of integration. 
Thus we have 


R-(Ri(t) x C) = ru(t)- GM(u(t) + K) = GMr(1 + u(t) -K), 


because u(t) is a unit vector. If 0(t) is the angle between u(t) and K then we 
may rewrite our equality as 


R-(R' x C) = GMr(1 + ||K]| cos 6). 
By a standard triple product formula, 
R- (R(t) x C)=(Rx R(t) -C, 


which in turn equals 
C-C=||CIP. 


(Here we have used the fact, which we derived in the proof of Kepler’s second 
law, that R x R’ = C.) 
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Thus 
\|C|/? = G@Mr(1 + ||K|| cos 4). 
(Notice that this equation can be true only if ||K|| < 1—just because we want 
the expression in parentheses to always be nonnegative. This fact will come 


up again below.) 
We conclude that 


got 
GM \1+'||K||cos@/ ° 


This is the polar equation for an ellipse of eccentricity ||K||. (Exercises 4 and 
5 will say a bit more about such polar equations.) 
We have verified Kepler’s first law. 


2.6.3. Kepler’s Third Law 


Look at Figure 2.12. The length 2a of the major axis of our elliptical orbit 
is equal to the maximum value of r plus the minimum value of r. From the 
equation for the ellipse we see that these occur, respectively, when cos @ is +1 
and when cos @ is —1. Thus 


5, - Hel? 4 Wo? EH? 
GM 1-|K] | GM 1+]K] GM(—|IKIP)’ 


We conclude that 


IC] = (a@M(1 — |KI?))””. 


Now recall from our proof of the second law that 
dA 1 
— = —|C|. 
5 = 3Iel 


Then, by antidifferentiating, we find that 


(2.6.3) 


1 
A(t) = 5IIClIt 


(There is no constant term since A(0) = 0.) Let A denote the total area inside 
the elliptical orbit and T the time it takes to sweep out one orbit. Then 


1 
A= A(P) = 5I|CIIT. 


Solving for T’ we obtain 


_ 2A 
ICI] 


But the area inside an ellipse with major axis 2a and minor axis 2b is 


A = nab = na?(1 — e?)*/?, 
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maximum value of r 


mimimum value of r 


FIGURE 2.12 
Analysis of the major axis. 


where e is the eccentricity of the ellipse. This equals 1a?(1 — ||K||?)!/? by 
Kepler’s first law. Therefore 


pe 2r@(1 = KP) 
ICI 


Finally, we may substitute (2.6.3) into this last equation to obtain 


TH Qna3/2 
(GM)!/2 


This is Kepler’s third law. 
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Math Nugget 
Johannes Kepler and the Motion of the Planets 


Johannes Kepler (1571-1630) is said to have been an ener- 
getic and affectionate man. He married his first wife in part 
for her money, and soon realized the error of his ways. When 
she died, he decided to apply scientific methods in the selec- 
tion of a second wife: he carefully analyzed and compared 
the virtues and defects of several ladies before selecting his 
second partner in matrimony. That marriage too was an 
unhappy one. 

Kepler’s scientific career also had its ups and downs. 
His attempt at collaboration with his hero Tycho Brahe fell 
victim to the incompatibility of their strong personalities. In 
his position as Royal Astronomer in Prague, a post which 
he inherited from Tycho Brahe, he was often unpaid. 

It appears that Kepler’s personal frustration, his terrific 
energy, and his scientific genius found a focus in questions 
about planetary motion. Kepler formulated his three laws 
by studying many years’ worth of data about the motion of 
the planets that had been gathered by Tycho Brahe. It is 
amazing that he could stare at hundreds of pages of numer- 
ical data and come up with the three elegant laws that we 
have discussed here. 

Kepler could have simplified his task considerably by using 
the tables of logarithms that John Napier (1550-1617) and 
his assistants were developing at the time. But Kepler could 
not understand Napier’s mathematical justifications for his 
tables, so he refused to use them. 
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Later, Newton conceived the idea that Kepler’s laws could 
be derived, using calculus, from his inverse square law of 
gravitational attraction. In fact it seems clear that this prob- 
lem is one of the main reasons that Newton developed the 
calculus. Newton’s idea was a fantastic insight: that physi- 
cal laws could be derived from a set of physical axioms was 


a new technique in the philosophy of science. On top of this, 
considerable technical proficiency was required to actually 
carry out the program. Newton’s derivation of Kepler’s laws, 
presented here in a modernized and streamlined form, is a 
model for the way that mathematical physics is done today. 


EXAMPLE 2.6.1 The planet Uranus describes an elliptical orbit about the sun. 
It is known that the semimajor axis of this orbit has length 2870 x 10° kilo- 
meters. The gravitational constant is G = 6.637 x 10~8 cm3/(g-sec?). Finally, 
the mass of the sun is 2 x 10° grams. Determine the period of the orbit of 
Uranus. 


Solution: Refer to the explicit formulation of Kepler’s third law that we 
proved above. We have 

T? An? 

a GM 
We must be careful to use consistent units. The gravitational constant G is 
given in terms of grams, centimeters, and seconds. The mass of the sun is in 
grams. We convert the semimajor axis to centimeters: a = 2870 x 10'! cm. 
= 2.87 x 10'4 cm. Then we calculate that 


1/2 
a oe 
GM 
An? 1/2 
= ——— D. 1 14)3 
(ae ae ere »') 


[70.308 x 1017]*/?sec. 
26.516 x 10° sec. 


2 


Notice how the units mesh perfectly so that our answer is in seconds. There 
are 3.16 x 10” seconds in an Earth year. We divide by this number to find 
that the time of one orbit of Uranus is 


T = 83.9 Earth years. | 
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a 


Exercises 


1. 


It is common to take the “mean distance” of a planet from the sun to be 
the length of the semimajor axis of the elliptical orbit. That is because 
this number is the average of the least distance to the sun and the greatest 
distance to the sun. Now you can answer these questions: 


(a) Mercury’s “year” is 88 Earth days. What is Mercury’s mean distance 
from the sun? 


(b) The mean distance of the planet Saturn from the sun is 9.54 as- 
tronomical units.? What is Saturn’s period of revolution about the 
sun? 


Show that the speed v of a planet at any point of its orbit is given by 


v=k(2-2). 
roa 


Suppose that the Earth explodes into fragments which fly off at the same 
speed in different directions into orbits of their own. Use Kepler’s third 
law and the result of Exercise 2 to show that all fragments that do not fall 
into the sun or escape from the solar system will eventually reunite later 
at the same point where they began to diverge (i.e., where the explosion 
took place). 


Kepler’s first law may be written as 
h?/k 
r= ———_... 
1+ ecosé 
Prove this assertion. Kepler’s second law may be written as 
dé 
2 
—=h. 
"at 


Prove this assertion too. Let F be the central attractive force that the sun 
exerts on the planet and F' its magnitude. Now verify these statements: 


(a) Fo =0 
(b) a = *e sind 
dr ke cos0 
a 
mk Mm 
(d) rod =-G re 


Use these facts to prove that a planet of mass m is attracted toward the 
origin (the center of the sun) with a force whose magnitude is inversely 
proportional to the square of r. (Newton’s discovery of this fact caused 
him to formulate his law of universal gravitation and to investigate its 
consequences. ) 


3Here one astronomical unit is the Earth’s mean distance from the sun, which is 
93, 000, 000 miles or 150, 000, 000 kilometers. 
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5. It is common to take h = ||C|| and &k = GM. Kepler’s third law may then 


be formulated as £ ost ‘ 
4n“a*b An < 
pase =(Z)e (*) 


(remember that b? = a?(1 — e?) = a?(1 — ||K||?)). Prove this formula. 


In working with Kepler’s third law, it is customary to measure T in 
Earth years and a in astronomical units (see Exercise 1, the footnote, for 
the definition of this term). With these convenient units of measure, (*) 
takes the simpler form T? = a?. What is the period of revolution T of a 
planet whose mean distance from the sun is 
(a) twice that of the Earth? 

(b) three times that of the Earth? 
(c) 25 times that of the Earth? 

6. Use your symbol manipulation software, such as Maple or Mathematica, 
to calculate the orbit of a planet having mass m about a “sun” of mass 
M, assuming that the planet is given an initial velocity of vo. 


TT 


2.7 Higher-Order Coupled Harmonic Oscillators 


We treat here some aspects of higher-order equations that bear a similarity 
to what we have learned about second-order examples. We shall concentrate 
primarily on linear equations with constant coefficients. As usual, we illustrate 
the ideas with a few key examples. 

We consider an equation of the form 


y”™) + tanger) + oes + ayy) + aoy = f . (2.7.1) 


Here a superscript \ denotes a jth derivative and f is some continuous 
function. This is a linear, constant-coefficient, ordinary differential equation 
of order n. 

Following what we learned about second-order equations, we expect the 
general solution of (2.7.1) to have the form 


Y=Yort Ug, 


where yo is a particular solution of (2.7.1) and y, is the general solution of 
the associated homogeneous equation 


y™ + any? Y +++» tary + apy = 0. (2.7.2) 
Furthermore, we expect that yg will have the form 


Yg = Ary1 + Agy2 +++++ An-1Yn-1 + AnYn ; 
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where the y; are “independent” solutions of (2.7.2) and the A, are arbitrary 
constants. 

We begin by studying the homogeneous equation (2.7.2) and seeking 
the general solution yg. Again following the paradigm that we developed for 
second-order equations, we guess a solution of the form y = e’”. Substituting 
this guess into (2.7.2), we find that 


e"@ : (r+ ayant +artao) = 0. 


Thus we are led to solving the associated polynomial 
r+ dn_1r” 1} + +++ +air+ao = 0. 


The fundamental theorem of algebra tells us that every polynomial of 
degree n has a total of n complex roots 11, r2, ..., Tn (there may be repetitions 
in this list). Thus the polynomial factors as 


(7 —11)- (r= 12)+++ (T= Tn-1) + (7 — Tn) - 


In practice there may be some difficulty in actually finding the complete set 
of roots of a given polynomial. For instance, it is known that for polynomials 
of degree 5 and greater there is no elementary formula for the roots. Let us 
pass over this sticky point for the moment, and continue to comment on the 
theoretical setup. 


I. Distinct Real Roots: For a given associated polynomial, if the polyno- 
mial roots r1,7r2,...,7%n are distinct and real, then we can be sure that 


are n distinct solutions to the differential equation (2.7.2). It then follows, 
just as in the order-2 case, that 


Yo = Aye + Age™ +39: pA, 


is the general solution to (2.7.2) that we seek. 


II. Repeated Real Roots: If the roots are real, but two of them are equal 
(say that r1 = r2), then of course e™!* and e”2” are not distinct solutions of 
the differential equation. Just as in the case of order-2 equations, what we 
do in this case is manufacture two distinct solutions of the form e™!” and 


x-e”, 


More generally, if several of the roots are equal, say rj = r2 = ++: = Tp, 
then we manufacture distinct solutions of the form e™”, x - e™!”, 
fs BOE oo hh agre, 
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III. Complex Roots: We have been assuming that the coefficients of the 
original differential equation ((2.7.1) or (2.7.2)) are all real. This being the 
case, any complex roots of the associated polynomial will occur in conjugate 
pairs a+ib and a—ib. Then we have distinct solutions e(¢+”)* and e(¢~*)*, 
Then we can use Euler’s formula and a little algebra, just as we did in the 
order-2 case, to produce distinct real solutions e*” cos bx and e sin bx. 


In the case that a complex root is repeated to order k, then we take 


e* cos ba, xe cos ba, ..., 2*~ te cos ba 


and 
k-1 ax 


e™ sin bx, xe“ sin bx,..., 2" ~e™ sin bx 
as solutions of the ordinary differential equation. 
EXAMPLE 2.7.3 Find the general solution of the differential equation 
y — 5y® + 4y = 0. 
Solution: The associated polynomial is 
r4—5y?+4=0. 
Of course we may factor this as (r? — 4)(r? — 1) = 0 and then as 


(r— 2)(r+2)(r-1)(r+1) =0. 


We find, therefore, that the general solution of our differential equation is 


y(2) = Aje”* oT Age?” + Age” + Age”. i | 


EXAMPLE 2.7.4 Find the general solution of the differential equation 
y — 8y 4 16y=0. 
Solution: The associated polynomial is 
r* — 8r?+16=0. 
This factors readily as (r? — 4)(r? — 4) = 0, and then as 
(r — 2)?(r +2)? =0. 


Thus the root 2 is repeated and the root —2 is repeated. According to our 
discussion in part IT, the general solution of the differential equation is then 


y(x) = Aye?* + Aore?* + Aze~?* + Age?” | 
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EXAMPLE 2.7.5 Find the general solution of the differential equation 


y — dy) 4 ay — ay + y=0. 


Solution: The associated polynomial is 


r* — Or? + Or? —2r+1=0. 


We notice, just by inspection, that r; = 1 is a solution of this polynomial 
equation. Thus r — 1 divides the polynomial. In fact 


rt — 27? 427? —2r +1 = (r—1)- (8-9? 4+r-1). 


But we again see that rg = 1 is a root of the new third-degree polynomial. 
Dividing out r—1 again, we obtain a quadratic polynomial that we can solve 
directly. 

The end result is 


ri — 2? + Or? —2r 1 = (r= 1)? (rr? +1) =0 


or 
(r —1)?(r —a)(r +4) =0. 


The roots are 1 (repeated), i, and —7. As a result, we find that the general 
solution of the differential equation is 


x) = Aye” + Agre” + Azcosx+ Agsinz. | 
y 


EXAMPLE 2.7.6 Find the general solution of the equation 
y — 5y@ + dy = sine. (2.7.6.1) 


Solution: In fact we found the general solution of the associated homogeneous 
equation in Example 2.7.1. To find a particular solution of (2.7.6.1), we use 
undetermined coefficients and guess a solution of the form y = acos#+( sina. 
A little calculation reveals then that y,(x) = (1/10)sinz is the particular 
solution that we seek. As a result, 


1 
y(x) = jp met Ape? (Ase Oa Age Ave * 


is the general solution of (2.7.6.1). a 
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1 


FIGURE 2.13 
Coupled harmonic oscillators. 


EXAMPLE 2.7.7 (Coupled Harmonic Oscillators) Linear equations of or- 
der greater than 2 arise in physics by the elimination of variables from simulta- 
neous systems of second-order equations. We give here an example that arises 
from coupled harmonic oscillators. Accordingly, let two carts of masses mj, 
mg be attached to left and right walls as in Figure 2.13 with springs having 
spring constants k,, ko. If there is no damping and the carts are unattached, 
then of course when the carts are perturbed we have two separate harmonic 
oscillators. 

But if we connect the carts, with a spring having spring constant k3, then 
we obtain coupled harmonic oscillators. In fact Newton’s second law of motion 
can now be used to show that the motions of the coupled carts will satisfy 
these differential equations: 


dx 
cere => —kyx1 + k3(xe — x1) . 
dx 
2 = = —kow — k3(aq — 21). 


We can solve the first equation for x2, 


1 dx 
v2 = mm («uth + k3] +m 3) 5 


and then substitute into the second equation. The result is a fourth order 
equation for 2x1. | 


rr 


Exercises 


In each of Exercises 1—15, find the general solution of the given differential equation. 
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yl” — 3y" + 2y' =0 
m _ 3y! + dy! — 2y =0 
y" —y=0 
y" +y=0 
yl” + 3y"” + 3y’ +y =0 
y + dy!" + by" + 4y! +y =0 
y —y=0 
y + 5y” + 4y =0 
yO — 2a7y" + a4ty =0 
yO + 2a?y" + aty =0 
yO + y+ 2y” 4+ 2y' +y =0 
yO + 2y'" — dy" — by! + 5y =0 
y” — by” + 11y’ — 6y =0 
yO +y!" — 3y" — Sy’ — 2y =0 


y® — by — 8y'" + 48y" + 16y’ — 96y = 0 


Find the general solution of y“ = 0. Now find the general solution of 
y = sina + 24. 

Find the general solution of y’” — 3y’ + 2y’ = 10 + 42e?”. 

Find the solution of y’” — y’ = 1 that satisfies the initial conditions 
y(0) = y'(0) = y"(0) = 4. 

Show that the change of independent variable given by x = e* transforms 
the third-order Euler equidimensional equation 


30m 


“uy + a2x” y’ +airy’ +aoy =0 


into a third-order linear equation with constant coefficients. Solve the 
following equations by this method. 

(a) xy!" + 3a7y" = 0 

(b) xy my xy" = Qey’ + 2y =0 

(c) xy me Qa? y"" + ay! -y= 0 

In determining the drag on a small sphere moving at a constant speed 
through a viscous fluid, it is necessary to solve the differential equation 


xe y + 8a7y!" + 8ary” — 8y' = Os 


If we make the substitution w = y’, then this becomes a third-order Euler 
equation that we can solve by the method of Exercise 19. Do so, and show 
that the general solution is 


y(x) = cx? +eon b +e3n 2 +e. 


(These ideas are part of the mathematical foundation of the work of 
Robert Millikan in his famous oil-drop experiment of 1909 for measuring 
the charge of an electron. He won the Nobel Prize for this work in 1923.) 


In Example 2.7.7, find the fourth-order differential equation for x1 by 
eliminating x2, as described at the end of the example. 
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22. In Exercise 21, solve the fourth-order equation for x; if the masses are 
equal and the spring constants are equal, so that m1 = mz = m and 
ki = ko = kz = k. In this special case, show directly that x2 satisfies 
the same differential equation as 7,. The two frequencies associated with 
these coupled harmonic oscillators are called the normal frequencies of 
the system. What are they? 


(ie 
Historical Note 


Euler 


Leonhard Euler (1707-1783), a Swiss by birth, was one of the foremost mathe- 
maticians of all time. He was also arguably the most prolific author of all time 
in any field. The publication of Euler’s complete works was begun in 1911 and 
the end is still not in sight. The works were originally planned for 72 volumes, 
but new manuscripts have been discovered and the work continues. 

Euler’s interests were vast, and ranged over all parts of mathematics and 
science. He wrote effortlessly and fluently. When he was stricken with blindness 
at the age of 59, he enlisted the help of assistants to record his thoughts. 
Aided by his powerful memory and fertile imagination, Euler’s output actually 
increased. 

Euler was a native of Basel, Switzerland, and a student of the noted math- 
ematician Johann Bernoulli (mentioned elsewhere in this text). His working 
life was spent as a member of the Academies of Science at Berlin and St. Pe- 
tersburg. He was a man of broad culture, well-versed in the classical languages 
and literatures (he knew the Aeneid by heart), physiology, medicine, botany, 
geography, and the entire body of physical science. 

Euler had 13 children. Even so, his personal life was uneventful and placid. 
It is said that he died while dandling a grandchild on his knee. He never taught, 
but his influence on the teaching of mathematics has been considerable. Eu- 
ler wrote three great treatises: Introductio in Analysin Infinitorum (1748), 
Institutiones Calculi Differentialis (1755), and Institutiones Calculi Integralis 
(1768-1794). These works both assessed and codified the works of all Euler’s 
predecessors, and they contained many of his own ideas as well. It has been 
said that all elementary and advanced calculus textbooks since 1748 are either 
copies of Euler or copies of copies of Euler. 

Among many other important advances, Euler’s work extended and per- 
fected plane and solid analytic geometry, introduced the analytic approach 
to trigonometry, and was responsible for the modern treatment of the func- 
tions Inz and e*. He created a consistent theory of logarithms of negative 
and imaginary numbers, and discovered that ln z has infinitely many values. 
Euler’s work established the use of the symbols e, 7, and i (for /—1). Euler 
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linked these three important numbers together with the remarkable formula 
e™=-1. 


Euler was also the one who established the notation sinx and cos, the use 
of f(a) for an arbitrary function, and the use of )* to denote a series. 

The distinction between pure and applied mathematics did not exist in 
Euler’s day. For him, the entire physical universe was grist for his mill. The 
foundations of classical mechanics were laid by Isaac Newton, but Euler was 
the principal architect of the subject. In his treatise of 1736 he was the first 
to explicitly introduce the concept of a mass-point or particle, and he was 
also the first to study the acceleration of a particle moving along any curve 
and to use the notion of a vector in connection with velocity and accelera- 
tion. Euler’s discoveries were so pervasive that many of them continue to be 
used without any explicit credit to Euler. Among his named contributions are 
Euler’s equations of motion for the rotation of a rigid body, Euler’s hydrody- 
namic equation for the flow of an ideal incompressible fluid, Euler’s law for 
the bending of elastic beams, and Euler’s critical load in the theory of the 
buckling of columns. 

Few mathematicians have had the fluency, the clarity of thought, and the 
profound influence of Leonhard Euler. His ideas are an integral part of modern 
mathematics. 


(I 


2.8 Bessel Functions and the Vibrating Membrane 
Bessel functions arise typically in the solution of Bessel’s differential equation 


xy” + xy’ + (a? — p*)y = 0. 
They are among the most important special functions of mathematical physics. 
In the present discussion we shall explore the use of these functions to describe 
Euler’s analysis of a vibrating circular membrane. The approach is similar to 
that for the vibrating string, which is treated elsewhere in the present chapter 
(Section 2.5). 

We shall be considering a uniform thin sheet of flexible material (polyester, 
perhaps). The sheet will be clamped along a given closed curve (a circle, 
perhaps) in the z-y plane and pulled taut into a state of uniform tension. 
Think, for instance, of a drum. 

When the membrane is displaced slightly from its equilibrium position 
and then released, the restoring forces created by the deformation cause it to 
vibrate. For instance, this is how a drum works. To simplify the mathematics, 
we shall consider only small oscillations of a freely vibrating membrane. 

We shall assume that the membrane lies in the z-y plane and that the 
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FIGURE 2.14 
A segment of the vibrating membrane. 


displacement of the membrane is so small that each point of the surface is 
moved only in the z direction (i.e., perpendicular to the plane of the mem- 
brane). The displacement is given by a function z = z(a, y,t), where t is time. 
We shall consider a small, rectangular piece of the membrane with dimensions 
Az and Ay. The corners of this rectangle are the points (x, y), (a + Az, y), 
(a,y + Ay), and (a + Az,y + Ay). This rectangle, and the portion of the 
displaced membrane that lies above it, are depicted in Figure 2.14. 

If m is the constant mass per unit area of the membrane, then the mass 
of the rectangular piece is m A az A y. Newton’s second law of motion then 
tells us that 

F=mAzdA oe (2.8.1) 
=mAztAys3 8. 
is the force acting on the membrane in the z-direction. 

When the membrane is in equilibrium position, the constant tension T in 
the surface has this physical meaning: Along any small line segment in the 
membrane of length As, the membrane material on one side of the segment 
exerts a force, normal to the segment and of magnitude T A s, on the mem- 
brane material on the other side. In this case, because the membrane is in 
equilibrium, the forces on opposite sides of the segment are both parallel to 
the «-y plane and cancel one another. When the membrane is curved (i.e., 
displaced), however, as in the frozen moment depicted in Figure 2.14, we shall 
assume that the deformation is so small that the tension is still T but now 
acts parallel to the tangent plane, and therefore has a nontrivial vertical com- 
ponent. It is the curvature of the distorted membrane that produces different 
magnitudes for these vertical components on opposite edges, and this in turn 
is the source of the restoring forces that cause the vibrating motion. 
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We analyze these forces by assuming that the piece of the membrane 
indicated in the figure is only slightly tilted. This makes it possible to replace 
the sines of certain small angles by their tangents, thereby simplifying the 
calculations. We proceed as follows. Along the upper and lower edges in the 
figure, the forces are perpendicular to the z-axis and almost parallel to the 
y-axis, with small z-components approximately equal to 


TAx() and -rA+($) . 
Oy ytAy Oy y 


Hence their sum is approximately equal to 


TAs ($) 7 ($) | 
oy ytAy oy y 
The subscripts on these partial derivatives indicate their values at the points 
(x,y + Ay) and (x,y). 


Performing the same type of analysis on the left and right edges, we find 
that the total force in the z-direction—coming from all four edges—is 


raul (5) ae) of eee (CB seay Cf 


As a result, equation (2.8.1) can be rewritten as 


(02/Ox)a4+Ax —(Oz/O%)e | 7,(02/OY)y+ay — (02z/Oy)y _ Pz 
peo e So foes Ea ee ge ~ TG 
If we now set a? = T/m and let Ax — 0, Ay — 0, then we find that 
Orz  ORz Orz 
Die Ol sey FOREN. Ose. 
a e& + xa) FR (2.8.2) 


this is the two-dimensional wave equation. We note that this is a partial dif- 
ferential equation: it involves functions of three variables, and of course partial 
derivatives. 

Now we shall consider the displacement of a circular membrane. So our 
study will be the model for a drum. Of course we shall use polar coordinates, 
with the origin at the center of the drum. The wave equation now has the 


form 
gf Oe: VOg-, A 0Fa\, 07% 
& oe am) = 3R° 
Here, naturally, z = z(r,@,t) is a function of the polar coordinates r,@ and of 
time t. We assume, without loss of generality, that the membrane has radius 
1. Thus it is clamped to its plane of equilibrium along the circle r = 1 in the 
polar coordinates. Thus our boundary condition is 


(2.8.3) 


z(1,0,t) =0, (2.8.4) 
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because the height of the membrane at the edge of the disc-shaped displaced 
region is 0. The problem, then, is to find a solution of (2.8.3) that satisfies 
the boundary condition (2.8.4) together with certain initial conditions that 
we shall not consider at the moment. 
We shall apply the method of separation of variables. Thus we seek a 
solution of the form 
z(r,0,t) = u(r)v(0) w(t) . (2.8.5) 


We substitute (2.8.5) into (2.8.3) and perform a little algebra to obtain 


wir) ivr), 1%) _ 1" 
u(r) r u(r) r2 v(6) ~ @2 w(t) (2.8.6) 


Now our analysis follows familiar lines: Since the left-hand side of (2.8.6) 
depends on r and @ only and the right-hand side depends on t¢ only, we conclude 
that both sides are equal to some constant K. In order for the membrane to 
vibrate, w(t) must be periodic. Thus the constant K must be negative (if it 
is positive, then the solutions of w” — Ka?w = 0 will be real exponentials and 
hence not periodic). We thus equate both sides of (2.8.6) with K = —? for 
some » > 0 and obtain the two ordinary differential equations 


w’(t) + a?w(t) = 0 (2.8.7) 

and 
u(r) _ lu(r) | 1") 
u(r) ru(r)  r? v(@) 


Now (2.8.7) is easy to solve, and its general solution is 


=—)’, (2.8.8) 


w(t) = c1 cos Aat + cz sin Aat. (2.8.9) 
We can rewrite (2.8.8) as 
Ae / MN 
u(r) u(r) 2,2 v" (9) 
=-— : 2.8.1 
wry age) 19 6) an 


Notice that in equation (2.8.10) we have a function of r only on the left and 
a function of 6 only on the right. So, as usual, both sides must be equal to 
a constant L. Now we know, by the physical nature of our problem, that vu 
must be 27-periodic. Looking at the right-hand side of (2.8.10) then tells us 
that L = n? for n € {0,1,2,...}. 

With these thoughts in mind, equation (2.8.10) splits into 


v" (0) + n7v(0) =0 (2.8.11) 


and 
ru (r) + ru'(r) + (?r? — n?)u(r) = 0. (2.8.12) 


Of course equation (2.8.11) has, as its general solution, 


v(0) = dy cosné + dz sinné. 
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(Note that this solution is not valid when n = 0. But, when n = 0 the 
equation has no nontrivial periodic solutions.) Also observe that equation 
(2.8.12) is a slight variant of Bessel’s equation (in fact a change of variables of 
the form r = w’, u = v- w°, for appropriately chosen b and c, will transform 
the Bessel’s equation given at the beginning of this discussion to equation 
(2.8.12)). It turns out that, according to physical considerations, the solution 
that we want of equation (2.8.12) is 


u(r) =k- JIn(r). 


Here k is a physical constant and J, is the nth Bessel function, discussed in 
detail in Chapters 3 and 4 below. Note for now that the Bessel functions are 
transcendental functions, best described with power series. 

Let us conclude this discussion by considering the boundary condition 
(2.8.4) for our problem. It can now be written simply as u(1) = 0 or 


FA) =0: 


Thus the permissible values of \ are the positive zeros of the Bessel function 
Jn (see also the discussion in Chapter 3 below). It is known that there are 
infinitely many such zeros, and they have been studied intensively over the 
years. The reference [WAT] is a great source of information about Bessel 
functions. 


a 


Problems for Review and Discovery 


A. Drill Exercises 


1. Find the general solution of each of these differential equations. 


(a) y” —3y’+y=0 
(b) y”+y’+y=0 
(c) y’ +6y’ +9y =0 
(d) y’—y'+6y=0 
(e) y” — 2y' -—5y=2 
(f) y’ +y=e" 

(g) y’+y'+y=sine 


(h) y”-y=e* 
2. Solve each of the following initial value problems. 
(a) y"+9y=0, y(0)=1, y/(0) =2 
(b) y"-y'+4y=2, yl) =2, (0) =1 
(c) y+ 2y'+5y=e", y(0)=—-1, y'(0)=1 
(d) y’+3y'+4y=sinz, y(m/2)=1, y'(a/2) =—-1 
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(e) yty=e"%, y(2)=0, y'(2) 

(f) y"—y=cosz, y(0)=3, y'(0)=2 

(g) y” =tane, y(1)=1, y(1)=-1 

(h) y”—2y’=Inz, y(1)=e, y/(1) =1/e 

Solve each of these differential equations. 

(a) y’ +3y' +2y = 22-1 

(b) y” — 3y' + 2y=e™ 

(c) y’ —y' —2y=cosx 

(d) y 

(e) y 
y 
x£ 


"4 9y' —y = ve* sing 

"4 Oy = sec 2x 

(f) y’ +4y' +4y=alne2 

(g) 2?y" + 3ay' +y = 2/x 

(h) y” +4y = tan? x 

Use the given solution of the differential equation to find the general 
solution. 

(a) y”—y=3e", w(x) =e 
(b) y”+y=—8sin3zr, yi(x) =sin3x 

(c) y +y'ty=2? +2042, y(e)=2? 
(d) y’+y¥ =, w(x) =n 


22 


B. Challenge Exercises 


1. 


Consider the differential equation y” + 4y = 0. Convert it to a system of 
first-order, linear ordinary differential equations by setting v = y’. Hence 


we have 
/ 
= Vv 
/ 


v = —Ay 


Find solutions y(x), v(x) for this system. If we think of x as a parameter, 
then the map 

t+— (y(x), v(x) 
describes a curve in the plane. Draw this curve for a variety of different 
sets of initial conditions. What sort of curve is it? 
Explain why yi(a) = sina and y2(x) = 2x cannot both be solutions of 
the same ordinary differential equation 


y" = F(x,y,y') 
for a smooth F’. 
Show that the Euler equation 


ay” — Qaxy' + 2y =0 
with initial conditions 
y(0) =0, y'(0) =0 


has infinitely many solutions. Why does this surprising result not contra- 
dict the general ideas that we learned in this chapter? 


124 PROBLEMS FOR REVIEW AND DISCOVERY 
4. Does the differential equation 


y +9y = —3cos 2a 


have any periodic solutions? Say exactly what you mean by “periodic” 
as you explain your answer. 


C. Problems for Discussion and Exploration 


1. Show that the ordinary differential equation y’ + y = cosa has a unique 
periodic solution. 


2. Find the regions where the solution of the initial value problem 


y” =-8y, y(0)=—-1 


is concave down. In what regions is it concave up? What do these prop- 
erties tell us about the solution? Do you need to actually solve the dif- 
ferential equation in order to answer this question? 


3. Consider solutions of the differential equation 


dy dy 

da? “de +¥=9 
for a constant c. Describe how the behavior of this solution changes as c 
varies. 


4. Endeavor to find an approximate solution to the differential equation 


a 
— +siny =0 

by guessing that the solution is a polynomial. Think of this polynomial 
as the Taylor polynomial of the actual solution. Can you say anything 
about how accurately your polynomial solution approximates the true 


solution? 


3 


Power Series Solutions and Special 
Functions 


e Power series basics 

e Convergence of power series 

e Series solutions of first-order equations 

e Series solutions of second-order equations 
e Ordinary points 

e Regular singular points 


e Frobenius’s method 


rr 


3.1 Introduction and Review of Power Series 


It is useful to classify the functions that we know, or will soon know, in an 
informal way. The polynomials are functions of the form 


ao + a,x + aga? ee apa"! + apa* 3 


where do, @1,.-.,@, are constants. This is a polynomial of degree k. A rational 

function is a quotient of polynomials. For example, 

(x) 3a3 —x +4 

r(z) = —————_ 
z2+5x2+1 


is a rational function. 

A transcendental function is one that is not a polynomial or a rational 
function or a root. The elementary transcendental functions are the ones that 
we encounter in calculus class: sine, cosine, logarithm, exponential, and their 
inverses and combinations using arithmetic/algebraic operations. 

The higher transcendental functions are ones that are not elementary and 
are defined using power series (although they often arise by way of integrals or 
asymptotic expansions or other means). These often are discovered as solutions 
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of differential equations. These functions are a bit difficult to understand, just 
because they are not given by elementary formulas. But they are frequently 
very important because they come from fundamental problems of mathemati- 
cal physics. As an example, solutions of Bessel’s equation, which we saw at the 
end of the last chapter, are called Bessel functions and are studied intensively 
(see [WAT)]). 

Higher transcendental functions are frequently termed special functions. 
These important functions were studied extensively in the eighteenth and 
nineteenth centuries—by Gauss, Euler, Abel, Jacobi, Weierstrass, Riemann, 
Hermite, Poincaré, and other leading mathematicians of the day. Although 
many of the functions that they studied were quite recondite, and are no 
longer of much interest today, others (such as the Riemann zeta function, the 
gamma function, and elliptic functions) are still intensively studied. 

In the present chapter we shall learn to solve differential equations using 
the method of power series, and we shall have a very brief introduction to how 
special functions arise from this process. It is a long chapter, with a number 
of new ideas. But there are many rewards along the way. 


3.1.1 Review of Power Series 


We begin our study with a quick review of the key ideas from the theory of 
power series. 


I. A series of the form 


Co 
Soa =aop tau tagu74+--- (3.1.1) 
j=0 


is called a power series in x. Slightly more general is the series 


Co 


a(x — a) ; 


j=0 


which is a power series in x — a (or expanded about the point a). 


II. The series (3.1.1) is said to converge at a point «x if the limit 


N 

lim a;x? 
N-w4s 

j=0 
exists. The value of the limit is called the sum of the series. (This is just the 
familiar idea of defining the value of a series to be the limit of its partial 
sums.) 

Obviously (3.1.1) converges when x = 0, since all terms but the first (or 
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zeroeth) will then be equal to 0. The following three examples illustrate, in 


an informal way, what the convergence properties might be at other values of 
x. 


(a) The series 


Se =i +a+ Qe? + 3lae +--- 
g=0 


diverges! at every x # 0. This can be seen by using the ratio test from the 
theory of series. It of course converges at x = 0. 


(b) The series 


ou 2 3 


ie i L | 
7 T Toy Tap tC 
reel 2! 3! 


converges at every value of x, including x = 0. This can be seen by applying 
the ratio test from the theory of series. 


(c) The series 


co 
Soci =ltrta? torte. 
j=0 


converges when || < 1 and diverges when |x| > 1. 


These three examples are special instances of a general phenomenon that 
governs the convergence behavior of power series. There will always be a num- 
ber R, 0 < R < c, such that the series converges for |x| < R and diverges for 
|x| > R. In the first example, R = 0; in the second example, R = +00; in the 
third example, R = 1. We call R the radius of convergence of the power series. 
The interval (—R, R) is called the interval of convergence. In practice, we check 
convergence at the endpoints of the interval of convergence by hand in each 
example. We add those points to the interval of convergence as appropriate. 
The next three examples will illustrate how we calculate R in practice. 


EXAMPLE 3.1.2 Calculate the interval of convergence of the series 


oe F 
gd 


j=0 4 


Solution: We apply the ratio test: 
erg ely 
ad [5° 


1Here we use the notation n! = n-(n—1)-(n—2)---3-2-1. This is called the factorial 
notation. By convention, 1! = 1 and 0! = 1. 


“2 
lim i oo er . 
joo (J + 1)? 


joo 


x) = ||. 
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We know that the series will converge when this limit is less than 1, or |z| < 1. 
Likewise, it diverges when |x| > 1. Thus the radius of convergence is R = 1. 
In practice, one has to check the endpoints of the interval of convergence 
by hand for each case. In this example, we see immediately that the series 
converges at « = +1. Thus we may say that the interval of convergence is 
[—1, 1]. a 


EXAMPLE 3.1.3 Calculate the interval of convergence of the series 


hae 
se 


j=0 J 
Solution: We apply the ratio test: 
ITV (Gy 
lim |= [9 +1) =/lim - -x| = |a|. 
jroo} ad /j joo j+l 


We know that the series will converge when this limit is less than 1, or |z| < 1. 
Likewise, it diverges when |z| > 1. Thus the radius of convergence is R = 1. 
In this example, we see immediately that the series converges at —1 (by 
the alternating series test) and diverges at +1 (since this gives the harmonic 
series). Thus we may say that the interval of convergence is [—1, 1). a 


EXAMPLE 3.1.4 Calculate the interval of convergence of the series 


[oe) 


gd 
ea 
j=0 7 
Solution: We use the root test: 
i 1/5 
j 
iim |=] = lim |=} =0. 
a bl peo 


Of course 0 < 1, regardless of the value of x. So the series converges for all x. 
The radius of convergence is +00 and the interval of convergence is (—oo, +00). 
There is no need to check the endpoints of the interval of convergence, because 
there are none. i) 


III. Suppose that our power series (3.1.1) converges for |z| < R with R> 0. 
Denote its sum by f(x), so 


fle) = Sajal = aq taye +aga? +++. (3.1.2) 
j=0 
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Thus the power series defines a function, and we may consider differentiating 
it. In fact the function f is continuous and has derivatives of all orders. We 
may calculate the derivatives by differentiating the series termwise: 


f'(2)= SS jae = a1 + 2agr + 3a3z7 +---, (3.1.3) 
j=l 

f'(2)= SIG —1)z7~? = 2a9 +3-2agr+---, (3.1.4) 
j=2 


and so forth. Each of these series converges on the same interval || < R. 
Observe that, if we evaluate (3.1.2) at « = 0, then we learn that 


ao = f (0). 


If instead we evaluate the series (3.1.3) at x = 0, then we obtain the useful 
fact that 


_ £0) 
ay = th 
If we evaluate the series (3.1.4) at « = 0, then we obtain the analogous fact 
that (0) 
apn 
ag = oy . 


Here the superscript () denotes a second derivative. 
In general, we can derive (by successive differentiations) the formula 
(0 
ce io : a‘ , (3.1.5) 
J! 


which gives us an explicit way to determine the coefficients of the power series 
expansion of a function. It follows from these considerations that a power 
series is identically equal to 0 if and only if each of its coefficients is 0. 

We may also note that a power series may be integrated termwise. If 


co 
f(x) = So ajx? = ag +aya tage? +---, 
j=0 
then 2 
gith ee es 
| Hede= Yeas = ant a> +a2y te +. 


If 


co 
F(x) = So aja? = ap + aye + ane? +--+ 
j=0 
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and 
lo) 
g(x) = S© dja? = bo + bia + bow? ++ 
j=0 
for |z| < R, then these functions may be added or subtracted by adding or 
subtracting the series termwise: 


f(a) £ g(x) = S— (a; + b;)a7 = (ao £ bo) + (a1 + bia + (ag + ba)? +--+. 
Also f and g may be multiplied as if they were polynomials, to wit 
F(x) g(a) = So ena”, 
j=0 


where 
Cn = aobn a aybn—1 spe p Anbo : 


We shall say more about operations on power series below. 

Finally, we note that if two different power series converge to the same 
function, then (3.1.5) tells us that the two series are precisely the same (i.e., 
have the same coefficients). In particular, if f(#) = 0 for |z| < R then all the 
coefficients of the power series expansion for f are equal to 0. 


IV. Suppose that f is a function that has derivatives of all orders on |x| < R. 
We may calculate the coefficients 
f(0) 


j! 


aj= 


and then write the (formal) series 
So aja! . (3.1.6) 
j=0 


It is then natural to ask whether the series (3.1.6) converges to f. When 
the function f is sine or cosine or logarithm or the exponential then the 
answer is “yes.” But these are very special functions. Actually, the answer 
to our question is generically “no.” Most infinitely differentiable functions do 
not have power series expansion that converges back to the function. In fact 
most have power series that do not converge at all; but even in the unlikely 
circumstance that the series does converge, it will generally not converge to 
the original function f. 

This circumstance may seem rather strange, but it explains why mathe- 
maticians spent so many years trying to understand power series. The func- 
tions that do have convergent power series are called real analytic and they are 
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very particular functions with remarkable properties. Even though the subject 
of real analytic functions is more than 300 years old, the first and only book 
written on the subject is [KRP1]. 

We do have a way of coming to grips with the unfortunate state of affairs 
that has just been described, and that is the theory of Taylor expansions. For 
a function with (n + 1) continuous derivatives, here is what is actually true: 


r e(Q) 
f@=>> Pete Rn(a) , (3.1.7) 


| 
jo 
where the remainder term R,,(x) is given by 


fone), 
ne) Grae 


for some number € between 0 and x. The power series converges to f precisely 
when the partial sums in (3.1.7) converge, and that happens precisely when 
the remainder term goes to zero. What is important for you to understand is 
that, generically, the remainder term does not go to zero. But formula (3.1.7) 
is still valid. 

We can use formula (3.1.7) to obtain the familiar power series expansions 
for several important functions: 


OO j 2 3 


i aI x ar 
7 ig pera ig te 
j=0 
= grit ee 
a — _1)I — poms es, Se oer 
pene 2 Gap att ae 
= . gd x? xt 
— —1)I = =, ae ts state 
cosx = 2 1) ania} Bl dl + 


Of course there are many others, including the logarithm and the other 
trigonometric functions. Just for practice, let us verify that the first of these 
formulas is actually valid. 

First, 


—e" =e for every j. 


Thus ; ; 
(d? /dx?)e*| _o - 1 

j! i 
This confirms that the formal power series for e” is just what we assert it 
to be. To check that it converges back to e”, we must look at the remainder 
term, which is 


J 


fEN® n+1 _ eae 


Bale) py eee 
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Of course, for x fixed, we have that |&| < |x|; also n — oo implies that 
(n+1)! — co much faster than x”"*!. So the remainder term goes to zero and 
the series converges to e”. 


V. Operations on Series 


Some operations on series, such as addition, subtraction, and scalar multipli- 
cation, are straightforward. Others, such as multiplication, entail subtleties. 


Products of Series 


In order to keep our discussion of multiplication of series as straightforward 
as possible, we deal at first with absolutely convergent series. It is convenient 
in this discussion to begin our sum at j = 0 instead of 7 = 1. If we wish to 


multiply 
S- a; and S- b;, 
j=0 j=0 


then we need to specify what the partial sums of the product series should 
be. An obvious necessary condition that we wish to impose is that if the first 
series converges to a and the second converges to 3 then the product series 
0 c;, whatever we define it to be, should converge to a - (3. 


The Cauchy Product 


Cauchy’s idea was that the summands for the product series should be 


m 
Cm = ) a;  Om—j- 
j=0 


This particular form for the summands can be easily motivated using polyno- 
mial considerations (which we shall provide later on). For now we concentrate 
on confirming that this “Cauchy product” of two series really works. 
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Theorem 3.1.5 Let ))j" a; and 0j-, 6; be two abso- 
lutely convergent series which converge to limits a and £, 
respectively. Define the series ea Cm With summands 


m 
Cm = ; a; bm—j- 
j=0 


Then the series \>**_. Gm converges to a: 3. 


EXAMPLE 3.1.6 Consider the Cauchy product of the two conditionally con- 
vergent series 


Observe that 


2 et eae 
- Jivmyi JRA 
cyncye 
Vvm+iv1 


However, 
(G+1)-(m4+1-9) < (m4+1)-(m+1) = (m4 1)’. 
Thus 
| 
“|S — =1. 
le 2 aa 


We thus see that the terms of the series }>*°_9 Cm do not tend to zero, so the 
series cannot converge. | 


EXAMPLE 3.1.7 The series 


Co 


A= S22 and B= 5°34 
j=0 j=0 


are both absolutely convergent. We challenge the reader to calculate the 
Cauchy product and to verify that that product converges to 3. | 
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VI. We conclude by summarizing some properties of real analytic functions. 


. Polynomials and the functions e”, sinz, cosa are all real analytic at all 
points. 


. If f and g are real analytic at vp then f +g, f-g, and f/g (provided 
g(ao) #0) are real analytic at xo. 


. If f is real analytic at vo and if f~! is a continuous inverse on an interval 
containing f(xo) and f’(zo) #0, then f~! is real analytic at f (zo). 


. If g is real analytic at xo and f is real analytic at g(xo), then f og is real 
analytic at x. 


. A function defined by the sum of a power series is real analytic at all interior 
points of its interval of convergence. 


VII. It is worth recording that all the basic properties of real power series 
that we have discussed above are still valid for complex power series. Such a 
series has the form 


Co Co 

aod ; J 
y C5 or y cj(z— a)’, 
j=0 j=0 


where the coefficients c; are real or complex numbers, a is a complex number, 
and z is a complex variable. The series has a radius of convergence R, and 
the domain of convergence is now a disc D(0,R) or D(a, R) rather than an 
interval. In practice, a complex analytic function has radius of convergence 
about a point a that is equal to the distance of the nearest singularity to a. 
See [KNO] or [GRK] or [KRP1] for further discussion of these ideas. 


Exercises 


1. Use the ratio test (for example) to verify that R = 0, R= oo, and R=1 
for the series (a), (b), (c) that are discussed in the text. 


2. Ifp+#0 and p is not a positive integer then show that the series 
5 Oe) ai 


! 
j=l J 


converges for |x| < 1 and diverges for |x| > 1. 
3. Verify that R = +00 for the power series expansions of sine and cosine. 


3.2. 


4. 


5. 
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Use Taylor’s formula to confirm the validity of the power series expansions 
for In(1+ 2), sinz, and cosz. 
When we first encounter geometric series we learn that 


2 n 1-2" 
1-2 


provided x # 1. Indeed, let S be the expression on the left and consider 
(1—«)-S. Use this formula to show that the expansions 


: =l+e4a?+4e74 
I a a 
and 
1 2_ 3 
=l-@+2 eo +-:- 
1l+2a 
are valid for |x| < 1. Use these formulas to show that 
2 3 4 
In(l +2) =2 > 5 — 
and : 
arctan xz = x ca - = 
ae es VES 
for |a| <1. 


Use the first expansion given in Exercise 5 to find the power series for 
1/(1— 2)? 

(a) by squaring; 

(b) by differentiating. 

(a) Show that the series 


satisfies y! = —y. 
(b) Show that the series 


MSE oe orgs geauaeGe © 
converges for all x. Verify that it defines a solution of the equation 
ry +y' +ay=0. 


This function is the Bessel function of order 0. It will be encountered 
later in other contexts. 
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3.2 Series Solutions of First-Order Differential Equa- 
tions 


Now we get our feet wet and use power series to solve first-order linear equa- 
tions. This will turn out to be misleadingly straightforward to do, but it will 
show us the basic moves. 

At first we consider the equation 


y =y. 


Of course we know that the solution to this equation is y = C - e”, but 
let us pretend that we do not know this. We proceed by guessing that the 
equation has a solution given by a power series, and we proceed to solve for 
the coefficients of that power series. 

So our guess is a solution of the form 


y= ag tayx+agx? +agz°4+---. 


Then 


y’ =a, 4+ 2age 4 3agx7 ++:: 


and we may substitute these two expressions into the differential equation. 
Thus 
ay + 2agxe + 30327 +---=antaixt+agr? +--- : 


Now the powers of x must match up (i.e., the coefficients must be equal). 
We conclude that 


ay, = ao 
2a2 = at 
343 = a2 


and so forth. Let us take ag to be an unknown constant C. Then we see that 


ay => C 
Ps C¢ 
ov = 9 
7 C 
CO? - a0 
etc. 
In general, 
C 
an = —. 
n! 
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In summary, our power series solution of the original differential equation 
is 


[oe Co C [oe 
1=) ae = =C aac e” 
j=0 Fao g20°4 


Thus we have a new way, using power series, of discovering the general solu- 
tion of the differential equation y’ = y. 


The next example illustrates the point that, by running our logic a bit dif- 
ferently, we can use a differential equation to derive the power series expansion 
for a given function. 


EXAMPLE 3.2.1 Let p be an arbitrary real constant. Use a differential equation 
to derive the power series expansion for the function 


y=(14+2)?. 
Solution: Of course the given y is a solution of the initial value problem 
(l+z)-y=py, y(0)=1. 
We assume that the equation has a power series solution 
eas « 
y= >o aja! = ao + att az? +--- 
j=0 


with positive radius of convergence R. Then 


CO 


y = ae 7 aja =a, + 2agqz 4 3a3x7 forse 
j=l 
loc) 

wy! = bee -ajx) = aya + 2agr? + 3agr° +--- 
j=l 


Co 
py = >_ pasa? = pag + pax + page” +--- 
j=0 


We rewrite the differential equation as 
/ Te =: 
y try = py. 


Now we see that the equation tells us that the sum of the first two of our 
series equals the third. Thus 


loc) CO [oe) 
jac + ja;xd = ax) . 
j j j 
j=l j=l j=0 
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We immediately see two interesting anomalies: the powers of x on the left- 
hand side do not match up, so the two series cannot be immediately added. 
Also the summations do not all begin in the same place. We address these two 
concerns as follows. 

First, we can change the index of summation in the first sum on the left 
to obtain 


CoO 
+0 + ljajpia? + ee Dima 
j=0 j=l 


Write out the first few terms of the changed sum, ~ the original sum, to see 
that they are just the same. 

Now every one of our series has x/ in it, but they begin at different places. 
So we break off the extra terms as follows: 


co co 
sl jt lajyia! + 5 joe SS paja! = —a,2° + pagx®. (3.2.1.1) 
j=l j=l 


Notice that all we have done is to break off the zeroeth terms of the first and 
third series, and put them on the right. 

The three series on the left-hand side of (3.2.1.1) are begging to be put 
together: they have the same form, they all involve matching powers of x, and 
they all begin at the same index. Let us do so: 


co 
[Gi + l)aj41 + ja; — pay)x? = —a1 + pao. 
j=l 
Now the powers of x that appear on the left are 1, 2,..., and there are none of 
these on the right. The right-hand side only contains x to the zeroeth power. 
We conclude that each of the coefficients on the left is zero; by the same 
reasoning, the coefficient (—a, + pao) on the right (i.e., the constant term) 
equals zero. Here are we are using the uniqueness property of the power series 
representation. It gives us infinitely many equations. 
So we have the equations?” 


—a,+pag = 0 
(G+ Laji+G-—pja; = 0 forj=1,2,.... 


Our initial condition y(0) = 1 tells us that a9 = 1. Then our first equation 
implies that a, = p. The next equation, with 7 = 1, says that 


2a. + (1—p)a, =0. 


Hence 


2A set of equations like this is called a recursion. It expresses later indexed ajs in terms 
of earlier indexed aj;s. 
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Continuing, we take 7 = 2 in the second equation to get 
3a3 + (2 — p)az =0 


sO 


peed eee J Ra a 


3 3-2 
We may continue in this manner to obtain that 


= pe ip= 2a 9 1) 
J 4! 


Thus the power series expansion for our solution y is 


y= itp MPa, HO=DP-2) 
ee 


Since we knew in advance that the solution of our initial value problem was 
y=(1+2)? 


(and this function is certainly analytic at 0), we find that we have derived 
Isaac Newton’s general binomial theorem (or binomial series): 


(L+a)P=1+pe+ MOOV,» Pen Dp) 
Ae 
eee j (3.2.1.2) 
j=0 


a 


Exercises 


1. For each of the following differential equations, find a power series so- 
lution of the form yy aj;x). Endeavor to recognize this solution as the 
series expansion of a familiar function. Now solve the equation directly, 
using a method from the earlier part of the book, to confirm your series 
solution. 


140 
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(a) y' = 2ay 

(b) y'+y=1 
(c) y-—y=2 
(d) y’+y=0 
(e) y’—-y=0 
(f) y-y=2" 


For each of the following differential equations, find a power series so- 
lution of the form a;x’. Then solve the equation directly. Compare 
your two answers, and explain any discrepancies that may arise. 


(a) wy’ =y 

(b) ay’ =y 

(c) y! — (1/a)y = 2? 

(4) y+ Q/e)y=2 | 
Express the function arcsinx in the form of a power series »; aj;x’? by 
solving the differential equation y’ = (1 — x)? in two different ways. 
[Hint: Remember the binomial series.] Use this result to obtain the for- 
mula 

dp. J eh pS 1 1-3-5 1 
2 3-23 2-4 5-25 2-4-6 7-27 


The differential equations considered in the text and in the preceding 
exercises are all linear. By contrast, the equation 


y =1+y? (*) 


is nonlinear. One may verify directly that y(x) = tan is the particular 
solution for which y(0) = 0. Show that 


tanx =ax2+ De: ze 4. 
7 3 15 


by assuming a solution for equation (*) in the form of a power series 
Dy a;x) and then finding the coefficients a; in two ways: 


(a) by the method of the examples in the text (note particularly how 
the nonlinearity of the equation complicates the formulas); 
(b) by differentiating equation (*) repeatedly to obtain 


y= 2Qyy’, yl” = yy" +2(y'), ete. 


and using the formula a; = f (0) /j!. 

Solve the equation 
y=x-y, y(0)=0 

by each of the methods suggested in the last exercise. What familiar 
function does the resulting series represent? Verify your conclusion by 
solving the equation directly as a first-order linear equation. 
Use your symbol manipulation software, such as Maple or Mathematica, 
to write a routine that will find the coefficients of the power series solution 
for a given first-order ordinary differential equation. 
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3.3. Second-Order Linear Equations: Ordinary Points 


We have invested considerable effort in studying equations of the form 
y +p-y t+q-y=0. (3.3.1) 


Here p and q could be constants or, more generally, functions. In some sense, 
our investigations thus far have been misleading; for we have only considered 
particular equations in which a closed-form solution can be found. These cases 
are really the exception rather than the rule. For most such equations, there is 
no “formula” for the solution. Power series then give us some extra flexibility. 
Now we may seek a power series solution; that solution is valid, and may be 
calculated and used in applications, even though it may not be expressed in 
a compact formula. 

A number of the differential equations that arise in mathematical 
physics—Bessel’s equation, Lagrange’s equation, and many others—in fact 
fit the description that we have just presented. So it is worthwhile to develop 
techniques for studying (3.3.1). In the present section we shall concentrate 
on finding a power series solution to equation (3.3.1)—written in standard 
form—expanded about a point 29, where zo has the property that p and q 
have convergent power series expansions about x. In this circumstance we 
call 9 an ordinary point of the differential equation. Later sections will treat 
the situation where either p or q (or both) has a singularity at xo. 

We begin our study with a familiar equation, just to see the basic steps, 
and how the solution will play out.? Namely, we endeavor to solve 


y"+y=0 
by power series methods. As usual, we guess a solution of the form 
y= > aja! = ap tax + az? +--- 
j=0 


Of course it follows that 


foe) 
y = y jajv!~* = ay + 2agr + 3agx” +--- 
and 


= S059 - lajat- 2 9-1-a9+3-2-agr+4-3- asx" 
g=2 


3Of course this is an equation that we know how to solve by other means. Now we are 
learning a new solution technique. 
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Plugging the first and third of these into the differential equation gives 


So iG —l)ajx~? + Saja? =0. 
j=2 j=0 


As in the last example of Section 3.2, we find that the series that occur have x 
raised to different powers, and that the summations begin in different places. 
We follow the standard procedure for repairing these matters. 

First, we change the index of summation in the second series. So 


SIG - L)a;a)~? + S- aj-22)* = 
j=2 j=2 


We invite the reader to verify that the new second series is just the same as 
the original second series (merely write out a few terms of each to check). We 
are fortunate in that both series now begin at the same index, and they both 
involve the same powers of x. So we may add them together to obtain 


foe) 
S~ Lil (j — La; + a;- ala? = 0. 
j=2 


The only way that such a power series can be identically zero is if each of 
the coefficients is zero. So we obtain the recursion equations 


9g —leptope=0, F= 2,8 ,4).005 


Then j = 2 gives us 


ag = 2-1 . 
It will be convenient to take ag to be an arbitrary constant A, so that 
_ A 
ag = 2-1 : 


The recursion for 7 = 4 says that 


a2 A 
4-3. 4-3-2-1° 


ag = — 


Continuing in this manner, we find that 


= 1) A 
ay = (WY on) ay) Dd 
gi A = 


Thus we have complete information about the coefficients with even index. 
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Now let us consider the odd indices. Look at the recursion for 7 = 3. This 
is i 
3-20 


It is convenient to take a, to be an arbitrary constant B. Thus 


a3 = 


a5 = - > 


In general, 
a2;41 = (—1)4 = ow he 
2j+1 — (27 41)!’ JF 14,00 
In summary, then, the general solution of our differential equation is given 
by 


y=. (Sy : ont”) +B. (iy ea”) , 


Of course we recognize the first power series as the cosine function and the 
second as the sine function. So we have rediscovered that the general solution 
of y”+y =0is 

y=A-cost+B-sina. 
This is consistent with what we learned earlier in the book about solving the 
equation y” + y = 0. ia 


EXAMPLE 3.3.1 Use the method of power series to solve the differential equa- 
tion 
(1 — 2? )y” — 2ay’ + p(p+ 1)y =0. G31.) 


Here p is an arbitrary real constant. This is called Legendre’s equation. 
Solution: First we write the equation in standard form: 


2x +1 
yl!” 2 y mA p(p ) 


=; 
=a? Lage" 


Observe that, near x = 0, division by 0 is avoided and the coefficients p and 
q are real analytic. So 0 is an ordinary point. 
We therefore guess a solution of the form 


Co 
y= y ajxi =ag + a,x +agr7+--: 
j=0 
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and calculate 


lo) 
a Si jeje * = a1 + 2agx + 3032? +--- 
j=l 
and 
co 


= So ij Maja? = 2a2 + 3-+2-agx+---. 


It is most convenient to treat the differential equation in the form (3.3.1.1). 
We calculate 


2? y" = — 5G — Naja! 


and ie 
—2ey' = — S- Qjajx? : 


Substituting into the differential equation now yields 
lo) lo) co co 
S25 — Yaga’? — $7 5G — Yaya? — S 0 2jajei + $7 p(y + Yaya? = 0. 
j=2 j=2 j=l j=0 


We adjust the index of summation in the first sum so that it contains x rather 
than 2~? and we break off spare terms and collect them. The result is 


(G+2)G + l)ajtex? — Dit (j —l)aja? — Sia 


we 


& 
i 
wo 


+50 p(pt aja! (20: + 6a3x — 2a,x 
j=2 


+p(p + 1)ao + p(p + jae) =0. 


In other words, 


Y(u+D0 + 1)aj+2 — j(9 — 1)aj — 25a; + p(p4 1a) 
j=2 
+ (2 + p(p+ 1)a0 + (0. — 2a, + p(p+ 1)a )a =0. 
As a result, 


(f+ 2)(9 + Lajze — 7G — Da; -—2ja;+p(ptla;=0 for 7 =2,3,... 


together with 
2a2 + p(p+1)ap = 0 
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and 
6a3 = 2a4 + p(p+ l)ay =0. 


We have arrived at the recursion relations 


ah 
= PEP ag 
_ _ (pt+2)(p—1) 
nn eon oe 
ae ee 
(p+j+1)(p Di for j = 2,3,.... (3:3:2:2) 


wr GF DGD 
We recognize a familiar pattern: The coefficients ap and a; are unspecified, 
so we set ag = A and a; = B. Then we may proceed to solve for the rest of 
the coefficients. Now 


+1 
a, =- 2+ Ps 
ag = 247 —1) B 
_ _(+3)(p—2)_ _ (t+3)(pt+ lp(p—2) 
— ee ae rm 
_ _(+4)(p—3) _ (p+4)(p+2)(p—1)(p—3) 
ee oe ra 5! 
__(+5)p—4) _ (+ 5) (p+ 3)(o+ Vp 2)—4) 
ial 5° 6! 
__(p+6)(p—5) _ _ (w+ 6)(p + 4) +2)(D— Dp—3)M—-5) , 
a me. 7 


and so forth. Putting these coefficient values into our supposed power series 
solution, we find that the general solution of our differential equation is 


A(t (p+1)p 2, (P+3)(P+1)plp~2) 4 


y 2 oh: 4! 
CEO NE WENGE 3) 
+3(2— CADO=D ys, BEN +O YO 9),s 
-2+ 9499+ 20- Yo“ HO-D.7,_...) 
7! , 


We assure the reader that, when p is not an integer, then these are not 
familiar elementary transcendental functions. These are what we call Legen- 
dre functions. In the special circumstance that p is a positive even integer, the 
first function (that which is multiplied by A) terminates as a polynomial. In 
the special circumstance that p is a positive odd integer, the second function 
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(that which is multiplied by B) terminates as a polynomial. These are called 
Legendre polynomials P,, and they play an important role in mathematical 
physics, representation theory, and interpolation theory. We shall encounter 
the Legendre polynomials later in the chapter. a 


It is actually possible, without much effort, to check the radius of conver- 
gence of the functions we discovered as solutions in the last example. In fact 
we use the recursion relation (3.3.1.2) to see that 
arg +207tt? 


25 -|x/? |z|? 


ee 


as j7 — oo. Thus the series expansion of the first Legendre function converges 
when |z| < 1, so the radius of convergence is 1. A similar calculation shows 
that the radius of convergence for the second Legendre function is 1. 

We now enunciate a general result about the power series solution of an 
ordinary differential equation at an ordinary point. The proof is omitted. 


Theorem 3.3.2 Let x9 be an ordinary point of the differ- 
ential equation 


y +p-y+q-y=0,7 (3.3.2.1) 


and let a and ( be arbitrary real constants. Then there 
exists a unique real analytic function y = y(x) that has a 
power series expansion about x9 and so that 


(a) The function y solves the differential equation 
(3.3.2.1). 

(b) The function y satisfies the initial conditions 
y(xo) = a, y’ (ao) = B. 


If the functions p and q have power series expansions about 
xo with radius of convergence R then so does y. 


We conclude with this remark. The examples that we have worked in de- 
tail resulted in solutions with two-term (or binary) recursion formulas: a2 was 
expressed in terms of ag and a3 was expressed in terms of a,, etc. In general, 
the recursion formulas that arise in solving an ordinary differential equation 
at an ordinary point may result in more complicated recursion relations. 


3.8. ORDINARY POINTS 147 


TT 


Exercises 


1. In each of the following problems, verify that 0 is an ordinary point. Then 
find the power series solution of the given equation. 


(a) y"+ay'+y=0 

(b) y"—y'+a2y=0 

(c) y"+2ey'-y=x 

(d) y" +y'—a*y=1 

(e) (L+2?)y"+2y'+y=0 

(f) y"+(1+a)y’-y=0 
2. Find the general solution of 


(1+27)y” + 2ry’ — 2y =0 


in terms of power series in x. Can you express this solution by means of 
elementary functions? 


3. Consider the equation y” + xy’ + y = 0. 


(a) Find its general solution y = >7, aj;x in the form y = ciyi(x) + 
coy2(x), where y1, y2 are power series. 
(b) Use the ratio test to check that the two series y; and y2 from part 
(a) converge for all « (as Theorem 3.3.2 actually asserts). 
(c) Show that one of these two solutions, say y1, is the series expansion 
of e~* /? and use this fact to find a second independent solution 
by the method discussed in Section 2.4. Convince yourself that this 
second solution is the function y2 found in part (a). 
4A. Verify that the equation y’” + y’ — zy = 0 has a three-term recursion 
formula and find its series solutions yi and y2 such that 


(a) m(0)=1, m0) =0 
(b) y2(0)=0, (0) =1 
Theorem 3.3.2 guarantees that both series converge at every x € R. 
Notice how difficult it would be to verify this assertion by working directly 
with the series themselves. 


5. The equation y” + (p+1/2—27/4)y = 0, where p is a constant, certainly 
has a series solution of the form y = > j aj’. 


(a) Show that the coefficients a; are related by the three-term recursion 
formula 


1 1 
(n+ 1)(n + 2)an+2 4 (» 5) Qn qan-2 = 0. 


(b) If the dependent variable is changed from y to w by means of y = 
wer® /4, then show that the equation is transformed into w” — aw’ + 
pw =0. 

(c) Verify that the equation in (b) has a two-term recursion formula and 


find its general solution. 
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Chebyshev’s equation is 


(1—2")y" — ay’ +p"y =0, 
where p is constant. 


(a) Find two linearly independent series solutions valid for |x| < 1. 


(b) Show that, if p =n where n is an integer > 0, then there is a polyno- 
mial solution of degree n. When these polynomials are multiplied by 
suitable constants, then they are called the Chebyshev polynomials. 
We shall see this topic again later in the book. 


Hermite’s equation is 
uM / 
y —2xcy + 2py=0, 


where p is a constant. 


(a) Show that its general solution is y(a) = a1yi(x) + a2ye2(x), where 


2p 2 2?p(p — 2) 4 


yi(z) = 1- ae + 7 ¢ 
3 
_ 2 p(p— 2)(p—4) 16 _ 
6! 
and 
2(p—1 2?(p — 1)(p—3 
3! 5! 
23 (p — — - 
PB eae coer iat) 
7! 

By Theorem 3.3.2, both series converge for all x. Verify this assertion 
directly. 


(b) If p is a nonnegative integer, then one of these series terminates and 
is thus a polynomial—y, if p is even and yp if p is odd—while the 
other remains an infinite series. Verify that for p = 0,1, 2,3,4,5, 
these polynomials are 1,a, 1 — 2a7, a — 243/3,1 — 4x? + 4a4/3, 4 — 
4x /3 + 40° /15. 

(c) It is clear that the only polynomial solutions of Hermite’s equation 
are constant multiples of the polynomials described in part (b). 
Those constant multiples which have the additional property that 
the terms containing the highest powers of x are of the form 2”2x” 
are denoted by Hn(x) and called the Hermite polynomials. Verify 
that Ho() = 1, Hi(x) = 2x, Ho(x) = 4x? — 2, H3(x) = 823 — 122, 
Ha(x) = 16a4 — 48x? + 12, and H5(x) = 32° — 160x? + 1202. 

(d) Verify that the polynomials listed in (c) are given by the general 
formula 


Use your symbol manipulation software, such as Maple or Mathematica, to 
write a routine that will find the coefficients of the power series solution, 
expanded about an ordinary point, for a given second-order ordinary 
differential equation. 
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(I 


3.4 Regular Singular Points 


Consider a second-order, linear differential equation in the standard form 
y tp-yt+q-y=0. 


Let us examine a solution about a point zo. If either of the coefficient functions 
p or q fails to be analytic at x9, then zp is not an ordinary point for the 
differential equation and the methods of the previous section do not apply. In 
this circumstance we call xo a singular point. 

There is some temptation to simply avoid the singular points. But often 
these are the points that are of the greatest physical interest. We must learn 
techniques for analyzing singular points. A simple example begins to suggest 
what the behavior near a singular point might entail. Consider the differential 
equation 


2 2 
y +e egy Ne (3.4.1) 


Obviously the point x) = 0 is a singular point for this equation. One may 
verify directly that the functions y(2) = x and y2(x) = x~? are solutions of 
this differential equation. Thus the general solution is 


y= Ac+ Br, (3.4.2) 


If we are interested only in solutions that are bounded near the origin, then we 
must take B = 0 and the solutions will have the form y = Az. Most likely, the 
important physical behavior will take place when B 4 0; we want to consider 
(3.4.2) in full generality. 

The solution of ordinary differential equations near singular points of arbi- 
trary type is extremely difficult. Many equations are intractable. Fortunately, 
many equations that arise in practice, or from physical applications, are of 
a particularly tame sort. We say that a singular point x9 for the differential 
equation 

yt+p-y+q-y=0 
is a regular singular point if 
(a—2)-p(z) and — (x ~ x9)*q(2) 
are analytic at zo. As an example, equation (3.4.1) has a regular singular point 


at 0 because 


is analytic at 0 and 
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is analytic at 0. 

Let us now consider some important differential equations from mathe- 
matical physics, just to illustrate how regular singular points arise in practice. 
Recall Legendre’s equation 


We see immediately that +1 are singular points. The point zo = 1 is a regular 
singular point because 


(@-1)-ple) = (@-1)- (- 5) = 


f= 2 c+i1 


and 


(«= 1)°a(e) = (0? ( 
are both real analytic at « = 1 (namely, we avoid division by 0). A similar 
calculation shows that zo = —1 is a regular singular point. 

As a second example, consider Bessel’s equation of order p, where p > 0 
is a constant: 


eet) Be 
1— x2 at+l1 


xy” + xy! + (2? — p*)y = 0. 


Written in the form 
1 2 2 


av 
y" +—y +? y=0, 
av av 


the equation is seen to have a singular point at x = O. But it is regular 


because 


z-p(z)=1 and 2?-q(x) =27-p? 
are both real analytic at 0. 

Let us assume for the rest of this discussion, for simplicity, that the regular 
singular point is at zo = 0. 

The key idea in solving a differential equation at a regular singular point 
is to guess a solution of the form 


y= y(a) =2™- (ag +a,z+aQz74+---). (3.4.3) 


We see that we have modified the guess used in the last section by adding a 
factor of x” in front. Here the exponent m can be positive or negative or zero— 
and m need not be an integer. In practice—and this is conceptually important 
to avoid confusion—we assume that we have factored out the greatest possible 
power of x; thus the coefficient ap will always be a nonzero constant. 

We call a series of the type (3.4.3) a Frobenius series. We now solve the 
differential equation at a regular singular point just as we did in the last 
section, except that now our recursion relations will be more complicated—as 
they will involve both the coefficients a; and also the new exponent m. The 
method is best understood by examining an example. 
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EXAMPLE 3.4.4 Use the method of Frobenius series to solve the differential 
equation 
2x? y" + 2(2e +1)y’-—y=0 (3.4.4.1) 


about the regular singular point 0. 


Solution: Writing the equation in the standard form 


we readily check that 


x(2x + 1) _ 2e+1 Ate ea) eee 
2x? 2 Qa? 2 


are both real analytic at x9 = 0. So xp = 0 is a regular singular point for this 
differential equation. 
We guess a solution of the form 


foe) CO 
=z. ajv) = a,ts 
j j 
j=0 j=0 


and therefore calculate that 


and 
co 
y" = o(mt jm + j- Yaya, 
j=0 
Notice that we do not begin the series for y’ at 1 and we do not begin the series 
for y” at 2, just because m may not be an integer so none of the summands 
may vanish (as in the case of an ordinary point). 


Plugging these calculations into the differential equation, written in the 
form (3.4.4.1), yields 


23 0(m + Amt j— Vaya 


2 (m+ ay gmtitt 4 Sones aja mi Saya <0. 
j=0 j 


We make the usual adjustments in the indices so that all powers of x are aT, 
and break off the odd terms so that all the series begin at the same place. We 
obtain 
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25 (m+ j)\(mt j - Vaya? +25 (mt j-Vajara™ 
j=l j=l 
By So(mts)aga™t -S— aut + (2n(m—1)ag2”-+ma92"—age" ) =0. 
j=l j=l 


The result is 


(20m t7)(m+j—1)a;+2(m+j—1)aj_1+(m+j)a; «;) =0 forj =1,2,3,... 
(3.4.4.2) 


together with 
[2m(m — 1) +m— lap = 0. 


It is clearly not to our advantage to let a9 = 0. Thus 
2m(m—-1)+m-1=0. 
This is the indicial equation. 
The roots of this quadratic equation are m = —1/2,1. We put each of 


these values into (3.4.4.2) and solve the two resulting recursion relations. 
Now (3.4.4.2) says that 


(2m? + 25? + 4mj — j —m—1)a; = (-2m — 25 + 2)aj_1. 


For m = —1/2 this is 


« p= 1 
ee ey cee 
We set ag = A, so that 
1 1 1 
a, = —ag = —-A 3 a2 Poet 970 34: etc. 
For m = 1 we have 
bos eed =e 
oo Bet yge th a pg te 
We set ao = B, so that 
2 2 2 4 
alia a es Ao a2 = — 701 = 2B ; etc. 


Thus we have found the linearly independent solutions 


Aw-¥?. (1-24 5a? — 4+) 


3.4. REGULAR SINGULAR POINTS 153 


and 


Be. Abs 


The general solution of our differential equation is then 


1 Doct 
Sl hg eae ia: Eira Spa Sg? eae 
y= Ax (aor: = arta Ba hea ea sea) 


There are some circumstances (such as when the indicial equation has a 
repeated root) that the method we have presented will not yield two linearly 
independent solutions. We explore these circumstances in the next section. 


a 


Exercises 


1. For each of the following differential equations, locate and classify its 
singular points on the x-axis. 
(a) 2°(%—1)y” — 2(a — 1)y’ + 3ay =0 
(b) 2?(a? —1)y” — a(1—2)y’ +2y =0 
(c) a?y” +(2—a)y' =0 
(d) (3x4 1)ry” — (a+ 1)y’ +2y =0 

2. Determine the nature of the point x = 0 (i.e., what type of singular point 
it is) for each of the following differential equations. 


(a) y”+(sinz)y=0 (d) 2x°y” +(sinx)y =0 
(b) xy” +(sinx)y =0 (e) x*y” +(sinx)y =0 
(c) a?y” +(sinz)y =0 


3. Find the indicial equation and its roots (for a series solution in powers 
of x) for each of the following differential equations. 


" 


(a) xy” + (cos 2x — 1)y’ + 2xy = 0 
(b) 4a?y” + (224 — 5a)y’ + (3a? + 2)y = 0 
(c) xy" + 3ay’ + 4ry = 0 
(d) ay” — 42Y2y’ + 32y = 0 

4. For each of the following differential equations, verify that the origin is 
a regular singular point and calculate two independent Frobenius series 
solutions: 


(a) 427y” + 3y'+y=0 (c) 2ay” + (x+1)y’ + 3y =0 
(b) 2ay”+(3-—2)y’-y=0 (d) 2x7y" + ay’ — (4 +1)y =0 
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When p = 0, Bessel’s equation becomes 


ay” +aey' +2°y=0. 


Show that its indicial equation has only one root, and use the method of 
this section to deduce that 


eS ea 


is the corresponding Frobenius series solution. 
Consider the differential equation 


” 1 tf 1 
Uo ag aay Oe 


(a) Show that x = 0 is an irregular singular point. 

(b) Use the fact that y: = x is a solution to find a second independent 
solution y2 by the method discussed earlier in the book. 

(c) Show that the second solution y2 that we found in part (b) cannot 
be expressed as a Frobenius series. 

Consider the differential equation 


” Poo qd 
ei 49 
a ge rae : 


where p and q are nonzero real numbers and b,c are positive integers. It 
is clear that, if b > 1 or c > 2, then x = 0 is an irregular singular point. 


(a) Ifb =2 and c=3, then show that there is only one possible value of 
m (from the indicial equation) for which there might exist a Frobe- 
nius series solution. 

(b) Show similarly that m satisfies a quadratic equation—and hence we 
can hope for two Frobenius series solutions, corresponding to the 
roots of this equation—if and only if b = 1 and c < 2. Observe 
that these are exactly the conditions that characterize = 0 as a 
“weak” or regular singular point as opposed to a “strong” or irregular 
singular point. 

The differential equation 


ay" + (82 -1)y’ +y=0 (x) 


has « = 0 as an irregular singular point. If 


y = 2&™(aotaiae+agx+---) 


41 2 
= av” tara” +a0™"t? 4+... 


is inserted into (x), then show that m = 0 and the corresponding Frobe- 
nius series “solution” is the power series 


y= So ale! 
j=0 


which converges only at x = 0. This demonstrates that, even when a 
Frobenius series formally satisfies such an equation, it is not necessarily 
a valid solution. 
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(I 
3.5 More on Regular Singular Points 


We now look at the Frobenius series solution of 
y’ t+p-y+q-y=0 


at a regular singular point from a theoretical point of view. 
Assuming that 0 is regular singular, we may suppose that 


co co 
=Sipjz and = a? - g(a) =) aya’, 
j=0 j=0 


valid on a nontrivial interval (—R, R). We guess a solution of the form 


and calculate 


and 7 
y= “G4+m)\G+m-—lazattm?, 


Then 


j=0 j=0 
co co 
— ee ae Soa; (m+ j)x 
j=0 j=0 
a (Som ran( ale 
j=0 \k=0 


where we have used the formula (Section 3.1) for the product of two power 
series. Now, breaking off the summands corresponding to k = 7, we find that 
this last is equal to 


lee) = 
gar? - (= Dj—kan(m + k) + poa;(m+ ») x! 


j=0 \k=0 
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Similarly, 
1 [oe} ; co 
a)-y = =| d a2") | laa? 
j=0 j=0 
Co Co 
j=0 j=0 


i 
( qj—-kak + a) x. 
j=0 \k=0 


We put the series expressions for y”, p- y’, and q- y into the differential 
equation, and cancel the common factor of «™”~?. The result is 


Yo faslin + jm+j—1) + (m+ j)po + a0] 
j=0 
k=0 


Now of course each coefficient of x? must be 0, so we obtain the following 
recursion relations: 


aj[(m + j)(m+ 7-1) + (m+ j)po + qo] 


j-l 
+ 5° agl(m + k)pj—-e + Gn] = 0 
k=0 
for 7 = 0,1,2,.... (Incidentally, this illustrates a point we made earlier, in 


Section 3.3: That recursion relations need not be binary.) 
It is convenient now to isolate the preceding formula for 7 = 0. This gives 
us the indicial equation 


f(m) = m(m — 1) + mpo + qo - 


This is an important and useful formula for the indicial equation. It is the 
equation that we solve to determine the values of m for any particular problem. 
Then the recursion relation for 7 = 0 is 


ao f(m) = 0 


(because the sum in the recursion is vacuous when j = 0). 
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The successive recursion relations are 
ay f(m+1)+ ao(mp1 + qi) = 0 


a2 f(m+ 2) + aolmp2 + go] + ai[(m+t 1)pi + qi] = 0 


aj f (mM+j)+ao[mpj+qj]+ar [(M+1)pj—-1+9qj;-1]+: » +aj-1[(M+j—1)pit+aq] =0 
etc. 


The first recursion formula tells us, since ap 4 0, that 


f(m) = m(m—1)+mpo+q =0. 


This is of course the indicial equation. The roots m1, mz of this equation are 
called the exponents of the differential equation at the regular singular point. 


Theorem 3.5.1 Suppose that xo = 0 is a regular singular 
point for the differential equation 


y"+p-y+q-y=0. 


Assume that the power series for x- p(x) and x? - q(x) have 
radius of convergence R > 0. Suppose that the indicial 
equation m(m — 1) + mpo + qo = 0 has roots m1,m2 with 
m1 < mg. Then the differential equation has at least one 
solution of the form 


co 
yy = a™ y a,x? 
j=0 


on the interval (—R, R). 
In case mz — my is not zero or a positive integer, then the 
differential equation has a second independent solution 


co 
y=a™ ) bjx? 
j=0 


on the interval (—R, R). 


Now let us explain, and put this theorem in context. If the roots mj, 
and mg are distinct and do not differ by an integer, then our procedures will 
produce two linearly independent solutions for the differential equation. If 
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my, < mg differ by an integer, say m2 = m, +k for some integer k > 1, then 
the recursion procedure breaks down because the coefficient of a; in the jth 
recursion relation for m will be 0—so that we cannot solve for a;. The case 
my, = mz also leads to difficulties—because then our methods only generate 
one solution. When m, and m2 differ by a positive integer, then it is only 
necessary to do the analysis (leading to a solution of the differential equation) 
for the exponent m,. The exponent mz is not guaranteed to lead to anything 
new. The exponent mz could lead to something new, as the next example 
shows. But we cannot depend on it. 


EXAMPLE 3.5.2 Find two independent Frobenius series solutions of 
/ 
zy +2y +a2y=0. 
Solution: We write the differential equation as 
iM 2 / 
y fae ou +1-y=0. 


Then x - p(x) = 2 and x?- q(x) = 2. Notice that the constant term po of x - p 
is 2 and the constant term qo of x? - q is 0. Thus the indicial equation is 


m(m—1)+2m+0=0. 


The exponents for the regular singular point 0 are then 0,—1. 
Corresponding to mz = 0, we guess a solution of the form 


foe) 
j=0 
which entails 
oe ie 
j=l 
and 
iad e 
u” =S°5G-Vaja??. 
j=2 


Putting these expressions into the differential equation yields 


We adjust the indices, so that all powers are x’, and break off the lower indices 
so that all sums begin at 7 = 1. The result is 


Y (wali +1)+2(7+1)]+ a1)! +2a,=0. 
j=l 
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We read off the recursion relations 


at =0 
and Ae a 
a 
a4 = - 
a GF FDGEFEDY 


for 7 > 1. Thus all the coefficients with odd index are 0 and we calculate that 


ao ao ao 


a Oe OO aa ee ae’ ea Oe 


etc. Thus we have found one Frobenius solution 


1 1 1 
Y1 => ao (1 Tal t All ae f+: +) 
1 x? a x? “i 
— — x — 
a0 x 3! 5! 
Dy oak 
= ag:—-sing. 
x 
Corresponding to m; = —1 we guess a solution of the form 
y=ax Seb git 
j=0 j=0 


and 


Putting these calculations into the differential equation gives 


SOU = DG — 2)bja? + S29 — dja? + SO ja? =O. 
j=0 j=0 j=0 
We adjust the indices so that all powers that occur are x/~? and break off 


extra terms so that all sums begin at 7 = 2. The result is 


SiIG — 1)G — 2)d; + 2G — 1)dj + by-2] 07? 


+ (29600 + (0)(—1)b1a7? + 2(—1)bpa-? + 2(0)010~*) = 6. 
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The four terms at the end cancel out. Thus our recursion relation is 


b;_2 
pea 
7 jG-1) 
for 7 > 2. 
It is now easy to calculate that 
bo bo bo 
Gi Bie it ct, 
Pi St? eet A Gee e 
Also 
by by by 
ne pe ee ee Tt 
Te ee adel 2s Toe 
This gives the solution 
1 oes i 1 
eae aes (1-5 ae —+ Jon = (= 52° tae =e -) 
1 
= bo: —-cosx+6,:—-sinax 
x 


Now we already discovered the solution (1/2) sin x in our first calculation 
with mz = 0. Our second calculation, with m, = —1, reproduces that solution 
and discovers the new, linearly independent solution (1/2) cos x. a 


EXAMPLE 3.5.3 The equation 
Ag? y" — 827 y' + (4a? + 1)y =0 
has only one Frobenius series solution. Find the general solution. 


Solution: The indicial equation is 
1 
m(m—1)+m-0+7=0. 


The roots are m, = 1/2,m2 = 1/2. This is a repeated indicial root, and it 
will lead to complications. 
We guess a solution of the form 


Thus 


e= >) GHIDae 
g=0 
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and 
Co 


y! = SOG +1/2)g- 1/2)a;x5~3/? 
j=0 


Putting these calculations into the differential equation yields 
de? - S°(j +:1/2)(j —1/2)a?9/? 
j=0 


82? \°G + 1/2)ajetV? + (40 +1) - Saja? = 0. 
j=0 j=0 


We may rewrite this as 
S529 + 1)(29 — Lajat 1? 


~ $7 (85 + Aayatt9/? 4S" dajat8? +S ayait¥/?, 


j=0 j=0 j=0 


We adjust the indices so that all powers of x are x/+!/? and put extra 
terms on the right so that all sums begin at 7 = 2. The result is 


S- (2s + 1)(27 — 1)a; — (87 — 4)aj_1 + 4aj_-2 + aj) tt” 
j=2 
= -l: (—1)ag2'/? —3-1-a,x°/? + dagx®/? — agx'/? — ayx?/? 
or ie 
S > [43705 => (87 = A)aj—1 + 4aj—9] gitt/2 = x3/? (dag — 4ay) . 
j=2 
We thus discover the recursion relations 
a1 = ao 
27 — 1)a;_1 — a;_ 
a= Ee Ts for j>2. 
J 
Therefore 
_% | 9% | _% | _% 4 
Jape ees Bh ape Re on 


We thus have found the Frobenius series solution to our differential equation 
given by 


= ee ' 
inte) = 0"? (1 Cap tore er a wi) ante 
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But now we are stuck because m = 1/2 is a repeated root of the indicial 
equation. There is no other. We can rescue the situation by thinking back to 
how we handled matters for second-order linear equations with constant coef- 
ficients. In that circumstance, if the associated polynomial had distinct roots 
r1,r2 then the differential equation had solutions yy = e"!* and yo = e”?”. 
But if the associated polynomial had repeated roots r and r, then solutions of 
the differential equation were given by y; = e”* and y2 = x-e"”. Reasoning by 
analogy, and considering that x (or its powers) is a logarithm of e*, we might 
hope that when m is a repeated root of the indicial equation, and y; = yi(2) 
is a solution corresponding to the root m, then y2(x) = Ina: yi (x) is a second 
solution. We in fact invite the reader to verify that 


ee 


yo(z) =Ina-a’/*-e 


is a second, linearly independent solution of the differential equation. So its 
general solution is 


y = Ar? .e? + Blna-r/? - e”. | 


lO 


Exercises 
1. The equation 
ay” — 3ay’ + (4c + 4)y = 0 
has only one Frobenius series solution. Find it. 


2. The equation 
Any” — 827 y' + (42? + 1)y =0 


has only one Frobenius series solution. Find the general solution. 


3. Find two independent Frobenius series solutions of each of the following 
equations. 
(a) ry” + 2y'+2y=0 
(b) ay” — 2%y' +(x” — 2)y =0 
(c) ry” —y'+4a°y =0 
4. Verify that the point 1 is a regular singular point for the equation 


(x — 1)?y" — 3(a — 1)y’ + 2y =0. 


Now use Frobenius’s method to solve the equation. 


5. Verify that the point —1 is a regular singular point for the equation 
3(x + 1)?y” — (w@ + 1)y’ —y =0. 


Now use Frobenius’s method to solve the equation. 
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6. Bessel’s equation of order p = 1 is 


20 


ay” +ay' +(x? -1)y =0. 


We see that zp = 0 is a regular singular point. Show that m1; — m2 = 2 
and that the equation has only one Frobenius series solution. Then find 
it. 

7. Bessel’s equation of order p = 1/2 is 


20 | +(2 +) 
wy +a2y +(2 qua 


Show that mz —m, = 1, but that the equation still has two independent 
Frobenius series solutions. Then find them. 


(I 
Historical Note 


Gauss 


Often called the “prince of mathematicians,” Carl Friedrich Gauss (1777— 
1855) left a legacy of mathematical genius that exerts considerable influence 
even today. 

Gauss was born in the city of Brunswick in northern Germany. He showed 
an early aptitude with arithmetic and numbers. He was finding errors in his 
father’s bookkeeping at the age of 3, and his facility with calculations soon 
became well known. He came to the attention of the Duke of Brunswick, who 
supported the young man’s further education. 

Gauss attended the Caroline College in Brunswick (1792-1795), where 
he completed his study of the classical languages and explored the works of 
Newton, Euler, and Lagrange. Early in this period he discovered the prime 
number theorem—legend has it by staring for hours at tables of primes. Gauss 
did not prove the theorem (it was finally proved in 1896 by Hadamard and 
de la Vallee Poussin). He also, at this time, invented the method of least 
squares for minimizing errors—a technique that is still widely used today. He 
also conceived the Gaussian (or normal) law of distribution in the theory of 
probability. 

At the university, Gauss was at first attracted by philology and put off 
by the mathematics courses. But at the age of eighteen he made a remark- 
able discovery—of which regular polygons can be constructed by ruler and 
compass—and that set his future for certain. During these years Gauss was 
flooded, indeed nearly overwhelmed, by mathematical ideas. In 1795, just as 
an instance, Gauss discovered the fundamental law of quadratic reciprocity. It 
took a year of concerted effort for him to prove it. It is the core of his celebrated 
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treatise Disquisitiones Arithmeticae, published in 1801. That book is arguably 
the cornerstone of modern number theory, as it contains the fundamental 
theorem of arithmetic as well as foundational ideas on congruences, forms, 
residues, and the theory of cyclotomic equations. The hallmark of Gauss’s 
Disquisitiones is a strict adherence to rigor (unlike much of the mathematics 
of Gauss’s day) and a chilling formality and lack of motivation. 

Gauss’s work touched all parts of mathematics—not just number theory. 
He discovered what is now known as the Cauchy integral formula, developed 
the intrinsic geometry of surfaces, discovered the mean-value property for 
harmonic functions, proved a version of Stokes’s theorem, developed the theory 
of elliptic functions, and he anticipated Bolyai and Lobachevsky’s ideas on 
non-Euclidean geometry. With regard to the latter—which was really an earth- 
shaking breakthrough—Gauss said that he did not publish his ideas because 
nobody (i.e., none of the mathematicians of the time) would appreciate or 
understand them. 

Beginning in the 1830s, Gauss was increasingly occupied by physics. He 
had already had a real coup in helping astronomers to locate the planet Ceres 
using strictly mathematical reasoning. Now he turned his attention to conser- 
vation of energy, the calculus of variations, optics, geomagnetism, and poten- 
tial theory. 

Carl Friedrich Gauss was an extraordinarily powerful and imaginative 
mathematician who made fundamental contributions to all parts of the sub- 
ject. He had a long and productive scientific career. When, one day, a messen- 
ger came to him with the news that his wife was dying, Gauss said, “Tell her 
to wait a bit until Iam done with this theorem.” Such is the life of a master 
of mathematics. 


a 


Historical Note 
Abel 


Niels Henrik Abel (1802-1829) was one of the foremost mathematicians of 
the nineteenth century, and perhaps the greatest genius ever produced by the 
Scandinavian countries. Along with his contemporaries Gauss and Cauchy, 
Abel helped to lay the foundations for the modern mathematical method. 
Abel’s genius was recognized when he was still young. In spite of grinding 
poverty, he managed to attend the University of Oslo. When only 21 years 
old, Abel produced a proof that the fifth-degree polynomial cannot be solved 
by an elementary formula (involving only arithmetic operations and radicals). 
Recall that the quadratic equation can be solved by the quadratic formula, and 
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cubic and quartic equations can be solved by similar but more complicated 
formulas. This was an age-old problem, and Abel’s solution was a personal 
triumph. He published the proof in a small pamphlet at his own expense. This 
was typical of the poor luck and lack of recognition that plagued Abel’s short 
life. 

Abel desired to spend time on the Continent and commune with the great 
mathematicians of the day. He finally got a government fellowship. His first 
year was spent in Berlin, where he became a friend and colleague of August 
Leopold Crelle. He helped Crelle to found the famous Journal ftir die Reine 
und Angewandte Mathematik, now the oldest extant mathematics journal. 

There are many parts of modern mathematics that bear Abel’s name. 
These include Abel’s integral equation, Abelian integrals and functions, 
Abelian groups, Abel’s series, Abel’s partial summation formula, Abel’s limit 
theorem, and Abel summability. The basic theory of elliptic functions is due 
to Abel (the reader may recall that these played a decisive role in Andrew 
Wiles’s solution of Fermat’s Last Problem). 

Like Riemann (discussed elsewhere in this book), Abel lived in penury. He 
never held a proper academic position (although, shortly before Abel’s death, 
Crelle finally secured for him a professorship in Berlin). The young genius 
contracted tuberculosis at the age of 26 and died soon thereafter. 

Crelle eulogized Abel in his Journal with these words: 


All of Abel’s works carry the imprint of an ingenuity and force of thought 
which is amazing. One may say that he was able to penetrate all obstacles 
down to the very foundation of the problem, with a force which appeared 
irresistible ... He distinguished himself equally by the purity and nobility 
of his character and by a rare modesty which made his person cherished 
to the same unusual degree as was his genius. 


It is difficult to imagine what Abel might have accomplished had he lived a 
normal lifespan and had an academic position with adequate financial support. 
His was one of the great minds of mathematics. 

Today one may see a statue of Abel in the garden of the Norwegian King’s 
palace—he is depicted stamping out the serpents of ignorance. Also the Abel 
Prize, one of the most distinguished of mathematical encomia, is named for 
this great scientist. 


Lr 


3.6 Steady-State Temperature in a Ball 


Let us show how to reduce the analysis of the heat equation in a three- 
dimensional example to the study of Legendre’s equation. Imagine a solid 
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ball of radius 1. We shall work in spherical coordinates (r,6,¢) on this 
ball. We hypothesize that the surface temperature at the boundary is held 
at g(0) = Tysin’@. Of course the steady-state temperature T will satisfy 
Laplace’s equation 


The Laplace operator is rotationally invariant and the boundary condition 
does not depend on the azimuthal angle ¢, hence the solution will not depend 
on ¢ either. One may calculate that Laplace’s equation thus takes the form 


1a / oT 1 8/. oT 


We seek a solution by the method of separation of variables. Thus we set 
T(r, 0) = A(r) - B(O). Substituting into (3.6.1) gives 


p42 (44), A 4 (dB) _ 
te ep ae Ge de 


Thus we find that 


ld (dA eer ae iB 

Aa (~ =) =~ Band ao (sin a) : 
The left-hand side depends only on r and the right-hand side depends only 
on @. We conclude that both sides are equal to a constant. Looking ahead to 
the use of the Legendre equation, we are going to suppose that the constant 
has the form c = n(n + 1) for n a nonnegative integer. This rather surprising 
hypothesis will be justified later. Also refer back to the discussion in Example 


3.3.1. 
Now we have this ordinary differential equation for B: 


1 1 d/. dB 
a ane de (sno) +n(n+1) =0. 


We make the change of variable 
v=cosé, y(v) = B(6). 
With the standard identities 


d d0d _ 1 ad 
dv dvdO ~~ sin@d@’ 


sin?6=1—v? and 


we find our differential equation converted to 


< (a - 7) +n(n+1)y=0. 
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This is equivalent to 


(1 yyy!" Qvy' +n(n+1)y=0. (3.6.2) 


This is Legendre’s equation. 

Observe that v = +1 corresponds to 6 = 0,7, i.e., the poles of the sphere. 
A physically meaningful solution will certainly be finite at these points. We 
conclude that the solution of our differential equation (3.6.2) is the Legendre 
polynomial y(v) = P,,(v). Therefore the solution to our original problem is 
B,(0@) = Py, (cos 6). 

Our next task is to solve for A(r). The differential equation (resulting 
from our separation of variables) is 
= (eS) =n(n+1)A. 


dr 


One can use the method of power series to solve this equation. Or one can 
take a shortcut and simply guess that the solution will be a power of r. Using 
the latter method, we find that 


An(r) = car™ + dar. 


Again, physical considerations can guide our thinking. We know that the tem- 
perature must be finite at the center of the sphere. Thus d, must equal 0. Thus 
An(r) = cnr”. Putting this information together with our solution in 0, we 
find the solution of Laplace’s equation to be 


T = cnr” Py, (cos). 


Here, of course, cp, is an arbitrary real constant. 
Now we invoke the familiar idea of taking a linear combination of the 
solutions we have found to produce more solutions. We write our general 


solution as 
T= 3 Cnr” Pr (cos 6) . 


Recall that we specified the oar temperature on the sphere (the boundary 
of the ball) to be T = Ty sin’ @ when r = 1. Thus we know that 


To sin? 6 = -> Cn Pn (cos 8) . 


It is then possible to use the theory of Fourier-Legendre expansions to solve 
for the c,. Since we have not developed that theory in the present book, we 
shall not carry out these calculations. We merely record the fact that the 
solution turns out to be 


8 16 8 
T=Tp (Fra(oos 0) — a" P2(cos 0) + 357 Palcos 6) : 
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Problems for Review and Discovery 


A. Drill Exercises 


1. 


Use the method of power series to find a solution of each of these differ- 
ential equations. 

(a) y” +2ay = 2? 

(b) y"—ay'+y=a 

(c) y+y'+y=2%-2 

(d) 2y" +ay'+y=0 

(e) (44+27)y"—y' +y=0 

(f) (a? +1)y”—ay’+y=0 

(g) y"—-(@+1y'—azy=0 

(h) (wy +(e@+)y'+y=0 

For each of the following differential equations, verify that the origin is 
a regular singular point and calculate two independent Frobenius series 
solutions. 

(a) (a? 4+1)2?y" — ay’ + (2+2)y=0 

(b) 2?y" +ay'+(1+2)y=0 

(c) ay” —4y'+2y =0 

(d) 4a7y” + 4a7y' + 2y = 0 

(e) 2ry”+(1—a)y’+y=0 

(f) xy” —(e—1)y’+2y=0 

(g) ay" +2(1—a)y'+y=0 

(h) ay” +(x+ ly’ +ty=0 

In each of these problems, use the method of Frobenius to find the first 


four nonzero terms in the series solution about x = 0 for a solution to 
the given differential equation. 


(a) ay!" 4 Qa? y"" (x x)y’ ry = 0 
(b) ay!" +.27y" — 3xy! + (a — ly =0 
(c) ay!" Qa? y"" (a? 2n)y" xy = 0 
(d) x®y!” + (22° — 2*)y” — ay! +y = 0 


B. Challenge Problems 


1. 


For some applications it is useful to have a series expansion about the 
point at infinity. In order to study such an expansion, we institute the 
change of variables z = 1/x (of course we must remember to use the 
chain rule to transform the derivative as well) and expand about z = 0. 
In each of the following problems, use this idea to verify that oo is a 
regular singular point for the given differential equation by checking that 
z = 0 is a regular singular point for the transformed equation. Then find 
the first four nonzero terms in the series expansion about oo of a solution 
to the original differential equation. 
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(a) ey" ie xy’ +y= 0 
(b) 9(a — 2)?(a — 3)y” + 6a(a — 2)y’ + 16y = 0 
(c) (1—2?)y” — 2ry'+ p(p+1)y=0  (Legendre’s equation) 


(d) 2?y"” + ay’ + (2? —p?)y=0  (Bessel’s equation) 
2. Laguerre’s equation is 


cy" +(1—2)y'+py =0, 
where p is a constant. Show that the only solutions that are bounded 


near the origin are series of the form 


soap t 1)--- Ln 1 ee 
1450 (=P ote P ) 


=1 


This is the series representation of a confluent hypergeometric function, 
and is often denoted by the symbol F'(—p, 1, x). In case p > 0 is an integer, 
then show that this solution is in fact a polynomial. These solutions are 
called Laguerre polynomials, and they play an important role in the study 
of the quantum mechanics of the hydrogen atom. 


3. The ordinary differential equation 


d?y 
4 2 
a—+dVy=0, «>0 
dx? eey 

is the mathematical model for the buckling of a column in the shape of 
a truncated cone. The positive constant depends on the rigidity of the 
column, the moment of inertia at the top of the column, and the load. 
Use the substitution « = 1/z to reduce this differential equation to the 
form 


Find the first five terms in the series expansion about the origin of a 
solution to this new equation. Convert it back to an expansion for the 
solution of the original equation. 


C. Problems for Discussion and Exploration 


1. Consider a nonlinear ordinary differential equation such as 
[sin y]y” + e4y’ —y? =0. 


Why would it be neither efficient nor useful to guess a power series solu- 
tion for this equation? 


2. Acelebrated theorem of Cauchy and Kowalewska guarantees that a non- 
singular ordinary differential equation with real analytic coefficients will 
have a real analytic solution. What will happen if you seek a real analytic 
(i.e., a power series) solution to a differential equation that does not have 
real analytic coefficients? 
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3. Show that if y is a solution of Bessel’s equation 


ay" + ay’ + (2° —p*)y =0 
of order p, then u(x) = x~°y(ax”) is a solution of 
vu" + (2¢+ lau’ + [a?b?x?? + (c? — p?b’)u =0. 


Use this result to show that the general solution of Airy’s equation y” — 


zy = 0 is 
Qa |3/? Q\a|3/? 
y =a? (Adj (A) + BJ_1/3 (5)) 


Here J, is the Bessel function defined by 


ite) = 3)" gee G)” 


k=0 


as long as n > 0 is an integer. In case n is replaced by p not an integer, 
then we replace (k + n)! by [(k + p+ 1), where 


r= f ate * da. 
0 


A 


Sturm—Liouville Problems and Boundary 
Value Problems 


e The concept of a Sturm—Liouville problem 
e How to solve a Sturm-—Liouville problem 
e Eigenvalues and eigenfunctions 


e Orthogonal expansions 


Singular Sturm—Liouville 


Separation of variables 


4.1 What Is a Sturm—Liouville Problem? 


We wish to introduce the idea of eigenvalues and eigenfunctions. We can mo- 
tivate the idea with the fairly extensive and far-reaching subject of Sturm— 
Liouville problems. 

A sequence y; of functions such that 


b 
i Ym(2)Yyn(x) dx = 0 for m#n 


is said to be an orthogonal system on the interval |a, b]. If 


b 
/ y; (x) dz =1 
a 
for each j then we call this an orthonormal system or orthonormal sequence. 
It turns out (see below) that the sequence of eigenfunctions associated with a 


wide variety of boundary value problems enjoys the orthogonality property. 
Now consider a differential equation of the form 


< (oa) + [Aq(x) + r(x)]y = 0; (4.1.1) 
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we shall be interested in solutions valid on a bounded interval [a, b]. We know 
that, under suitable conditions on the coefficients, a solution of equation 
(4.1.1) that takes a prescribed value and a prescribed derivative value at a 
fixed point 2x9 € [a, b] will be uniquely determined. In other circumstances, we 
may wish to prescribe the values of y at two distinct points, say at a and at b. 
We now begin to examine the conditions under which such a boundary value 
problem has a nontrivial solution. 


EXAMPLE 4.1.2 Consider equation (4.1.1) with p(#) = q(x) = 1 and r(x) = 0. 
Then the differential equation becomes 


y’ +Ay=0. (4.1.2.1) 
We take the domain interval to be [0,7] and the boundary conditions to be 
y(0)=0, y(m) =0. (4.1.2.2) 
Let us determine the eigenvalues and eigenfunctions for this problem. 


Solution: The situation with boundary conditions is quite different from that 
for initial conditions. The latter is a sophisticated variation of the fundamental 
theorem of calculus. The former is rather more subtle. So let us begin to 
analyze. 

First, if \ < 0 then the solutions of the differential equation are exponen- 
tials. So no nontrivial linear combination of these can satisfy the boundary 
conditions (4.1.2.2). 

If \ = 0 then the general solution of (4.1.2.1) is the linear function y = 
Ax + B. Such a function cannot vanish at two points unless it is identically 
zero. 

So the only interesting case is A > 0. In this situation, the general solution 
of (4.1.2.1) is 

y= Asin Vir + Boos V Xx. 


Since y(0) = 0, this in fact reduces to 
y = AsinV Ax. 


In order for y(7) = 0, we must have Am = nz for some positive integer n, 

thus \ = n?. These values of are termed the eigenvalues of the problem, and 

the corresponding solutions 
sing, sin2x, sin3z... 


are called the eigenfunctions of the problem (4.1.2.1), (4.1.2.2). a 


We note these immediate properties of the eigenvalues and eigenfunctions 
for our problem: 
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(i) If ¢ is an eigenfunction for eigenvalue A, then so is c- ¢ for any 
constant c. 
(ii) The eigenvalues 1,4,9,... form an increasing sequence that ap- 
proaches +co. 
(iii) The nth eigenfunction sin nz vanishes at the endpoints 0,7 (as we 


originally mandated) and has exactly n — 1 zeros in the interval 
(0,7). 


It will turn out—and this is the basis for the Sturm—Liouville theory— 
that, if p,q > 0 on [a,b], then equation (4.1.1) will have a solvable boundary 
value problem—for a certain discrete set of values of A—with data specified 
at points a and b. These special values of » will of course be the eigenvalues 
for the boundary value problem. They are real numbers that we shall arrange 
in their natural order 


Ay < AQ <0 << An <eee, 


and we shall learn that A; — +oo. The corresponding eigenfunctions will then 
be ordered as y1, ya,-.-- 

Now let us examine possible orthogonality properties for the eigenfunc- 
tions of the boundary value problem for equation (4.1.1). Consider the differ- 
ential equation (4.1.1) with two different eigenvalues \,, and A, and y» and 
Yn the corresponding eigenfunctions: 


& (162) SH) + (rmale) + r(e)l¥m = 0 


and 
& (vio) + nate) +rle)lyn =0. 


We convert to the more convenient prime notation for derivatives, multiply 
the first equation by yn, and the second by ym, and subtract. The result is 


Yn(PYin) — Ym(PYn)’ + (Am — An)WYmYn = 0. 


We move the first two terms to the right-hand side of the equation and 
integrate from a to b. Hence 


b 


b b 
(Am = An) f Vite = J vlog)! dx — | Yn(PYm) dx 


a 


b 
(parts) b 
PS [ym(pyh) |, - / Yin (PY) dx 


—[yn(oyln) |, + +f Yr (PY) dex 


= P(b)[Ym(b)¥n (0) — Yn (0)Ym (0) 
—P(4)[Ym(@)¥%n(@) = Yn(@) Ym (a)] - 
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Notice that the two integrals on the right have cancelled. 
Let us denote by W(a) the Wronskian determinant of the two solutions 
Ym; Yn. Thus 
W (ax) = Ym() Yn (#) = Yn (2) Yn (@) - 


Then our last equation can be written in the more compact form 


b 
(Am =n) faut dz = p(B) (0) ~ pla) (a). 


Notice that things have turned out so nicely, and certain terms have cancelled, 
just because of the special form of the original differential equation. 

We want the right-hand side of this last equation to vanish. This will 
certainly be the case if we require the familiar boundary condition 


y(a) =0 and y(b) =0 
or instead we require that 
y(a)=0 and y/(b)=0. 
Either of these will guarantee that the Wronskian vanishes, and therefore 
b 
/ Ym *Yn-qdx =0. 
This is called an orthogonality condition with weight q. 


With such a condition in place, we can consider representing an arbitrary 
function f as a linear combination of the y;: 


f(x) = aryi(x) + agye(x) +++ + ajy;(x) +-°-. (4.1.3) 


We may determine the coefficients a; by multiplying both sides of this equation 
by yx: gq and integrating from a to b. Thus 


[ f(2)yn(x)q(a) dx = i. (anto stan (any ese 
+ajyj(x) ++: +) me(a)a(e his 


b 
= a) Yj (x) yn (x) q(x) dx 


I 


b 
anf yil2)ala) ae 
since all but one of the integrals vanishes (by orthogonality). Thus 


_ Ja F(a)yn(a)ala) de 


MOT OL, 
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There is an important question that now must be asked. Namely, are 
there enough of the eigenfunctions y; so that virtually any function f can 
be expanded as in (4.1.3)? For instance, the functions y;(#) = sin x, y3(a) = 
sin 32, y7(a) = sin 7x are orthogonal on [—7,7], and for any function f one 
can calculate coefficients a ,a3,a7. But there is no hope that a large class of 
functions f can be spanned by just y1, y3, yz. We need to know that our y;’s 
“fill out the space.” The study of this question is beyond the scope of the 
present text, as it involves ideas from Hilbert space (see [RUD1], [RUD2]). 
Our intention here has been merely to acquaint the reader with some of the 
language of Sturm—Liouville problems. 


Math Nugget 


Charles Hermite (1822-1901) was one of the most eminent 
French mathematicians of the nineteenth century. He was 
particularly noted for the elegance, indeed the artistry, of 
his work. As a student, he courted disaster by neglecting his 
routine assignments in order to study the classic masters of 
mathematics. Although he nearly failed his examinations, 
he became a first-rate and highly creative mathematician 
while still in his early twenties. In 1870 he was appointed to 
a professorship at the Sorbonne, where he trained a whole 
generation of important French mathematicians; these in- 
cluded Picard, Borel, and Poincaré. 

The unusual character of Hermite’s mind is suggested by 
the following remark of Poincaré: “Talk with M. Hermite. 
He never evokes a concrete image, yet you soon perceive 
that the most abstract entities are to him like living crea- 
tures.” He disliked geometry, but was strongly attracted to 
number theory and analysis; his favorite subject was elliptic 
functions, where these two subjects interact in remarkable 
ways. 

Several of Hermite’s purely mathematical discoveries 
had unexpected applications many years later to mathe- 
matical physics. For example, the Hermite forms and ma- 
trices, which he invented in connection with certain prob- 
lems of number theory, turned out to be crucial for Heisen- 
berg’s 1925 formulation of quantum mechanics. Also Her- 
mite polynomials and Hermite functions are useful in solv- 
ing Schrédinger’s wave equation. 
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a 


Exercises 


1. The differential equation 


P(a)y" + Q(x)y' + R(x)y = 0 


is called exact if it can be written in the form [P(x)y’]’ + [S(x)y]’ = 0 
for some function S(x). If such an equation is not exact, it can often 
be made exact by multiplying through by a suitable integrating factor 
u(x) (actually, there is always an integrating factor—but you may have 
trouble finding it). The function s(x) must satisfy the condition that the 
equation 


u(x) P(x)y" + n(x)Q(x)y’ + w(x) R(x)y = 0 
can be expressed in the form 
[(x) P(x)y'] 


for some appropriate function S. Such an equation can be solved by the 
method of first-order linear equations. 


/ 


+ [S(e)yJ =0 


Show that this w must be the solution of the adjoint equation 
P(a)u" (x) + [2P"(x) — Q(x)] u'(a) + [P"(z) — Q(z) + R(a)] w(x) = 0. 


Often the adjoint equation is just as difficult to solve as the original 
differential equation. But not always. Find the adjoint equation in each 
of the following instances. 


(a) Legendre’s equation: (1 —2?)y” — 2zy’ + p(p+1)y =0 


(b) Bessel’s equation: x?y” + xy’ + (2? — p?)y =0 
(c) Hermite’s equation: y” — 2ry’ + 2py = 0 


(d) Laguerre’s equation: zy” + (1—2)y' + py =0 
2. Refer to Exercise 1. Consider the Euler equidimensional equation, 
xy” + xy! = ny = 0, 


which we have seen before. Here n is a positive integer. Find the values 
of n for which this equation is exact, and for these values find the general 
solution by the method suggested in Exercise 1. 


3. Refer to Exercise 1 for terminology. Solve the equation 


y" (242) y 4y =0 
x 


by finding a simple solution of the adjoint equation by inspection. 


4. Refer to Exercise 1. Show that the adjoint of the adjoint of the equation 
P(x)y” + Q(a)y’ + R(x)y = 0 is just the original equation. 
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5. Refer to Exercise 1 for terminology. The equation P(x)y” + Q(x)y’ + 
R(x)y = 0 is called self-adjoint if its adjoint is just the same equation 
(after a possible change of notation). 

(a) Show that this equation is self-adjoint if and only if P’(x) = Q(z). 
In this case the equation becomes 
P(a)y" + P'(x)y' + R(x) =0 
or 
[P(x)y') + R(x)y =0. 
This is the standard form for a self-adjoint equation. 
(b) Which of the equations in Exercise 1 are self-adjoint? 


6. Show that any equation P(x)y” + Q(x)y’ + R(x)y = 0 can be made 
self-adjoint by multiplying through by 


1 ofQ/P) de 
P 


7. Using Exercise 5 when appropriate, put each equation in Exercise 1 into 
the standard self-adjoint form described in Exercise 6. 


a 
4.2 Analyzing a Sturm—Liouville Problem 
Let us now formulate and analyze some Sturm—Liouville problems. 


EXAMPLE 4.2.1 For fixed n, the Bessel equation 


2 
e+ (ee “)y=0, a<r<b, (4.2.1.1) 


ax 


is a Sturm—Liouville problem. Here p = x, q = x, \ = k?, and r = n?/x. We 
impose the endpoint conditions y(a) = 0 and y(b) = 0. a 


EXAMPLE 4.2.2 Consider the differential equation 
y’ +rAy=0, -at#<aK<r. 
We impose the period endpoint conditions 
y(—r) = (x) 
y(—m) -=- y(n). 
It is straightforward to calculate that the eigenfunctions for this system are 1, 
cosnx, and sinnx for n a positive integer. The corresponding eigenvalues are 


n?. Note that, for n > 0, there are two distinct eigenfunctions with the same 
eigenvalue n?. But, for n = 0, there is only one eigenfunction. |_| 
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EXAMPLE 4.2.3 The Mathieu equation is 
y’ + (A+ 16dcos2x)y=0, O<aK<nr. 


Notice that p = 1, gq = 1, and r = —16dcos2z. Of course d is a constant 
and A is a constant. We can impose the endpoint conditions y(0) = y(z), 


y'(0) = y'(m) or y(0) = —y(7), y'(0) = —y"(z). 


An important feature of the eigenfunctions of a Sturm—Liouville problem 
is their “orthogonality.” This idea we now explain. 


Proposition 4.2.4 Eigenfunctions u and v having different 
eigenvalues for a Sturm—Liouville problem as in equation 
(4.1.1) are orthogonal with weight q in the sense that 


b 
/ [u(x)v(x)] q(x) dx =0. (4.2.4.1) 


Proof: Suppose that u is an eigenfunction corresponding to eigenvalue A and 
v is an eigenfunction corresponding to eigenvalue yw. And assume that A 4 wu. 
Let us use the operator notation 


Ly] = [p(x)y'V + r(z)y- 


Observe that saying that u is an eigenfunction with eigenvalue 2 is just the 
same as saying that 
Llu] = —Aqu (4.2.4.2) 


and likewise 
Liv] = —pqu. (4.2.4.3) 


Then we know that 
Llu] + dg(a)u = L[v] + pq(a)v = 0. 
Now observe, by direct calculation, that 


d 


uL{e] — wbfu) = 4 wa)luCeye'(e) — vleyu'e] 


We integrate this identity from a to b, using the equalities in (4.2.4.2) and 
(4.2.4.3). The result is 


b 
(A— #) / q(x)u(x)o(a) da = p(x) [u(x)v" (x) — v(w)u'(x)] 
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If we now use our endpoint conditions au(a) + a’u'(a) = 0 and Gu(b) + 
B'u'(b) = 0, we may see that 


p(a)[u(a)u'(a) — v(a)u'(a)] = [ap(a)/a’][u(a)v(a) — v(a)u(a)] = 0 


provided that a’ 4 0. If a 0 then the right-hand side of (4.2.4.4) similarly 
reduces at x = a to 


[o"p(a)/a][u'(a)v'(a) — v'(a)u'(a)] = 0. 


As a result, 
p(a)[u(a)u'(a) — v(a)u'(a)] = 0 
unless a = a’ = 0. Similar calculations apply to the endpoint x = b. Since 
we do not allow both a = a’ = 0 and 2 = f’ = 0, we find therefore that 
the right-hand side of (4.2.4.4) vanishes. Formula (4.2.4.1) now follows from 
formula (4.2.4.4) after dividing through by the nonzero factor » — p. oO 


In asense, Sturm—Liouville theory is a generalization of the Fourier theory 
that we shall learn in Chapter 6. One of the key ideas in the Fourier theory is 
that most any function can be expanded in a series of sine and cosine functions. 
And of course these two families of functions are the eigenfunctions for the 
problem y’’ + Ay = 0. The eigenvalues (values of X) that arise form an infinite 
increasing sequence, and they tend to infinity. And only one eigenfunction 
corresponds to each eigenvalue. We shall learn now that these same phenomena 
take place for a Sturm—Liouville problem. 

Now suppose that, for a given Sturm—Liouville problem on an interval 
(a, b], the eigenvalues are A; and the corresponding eigenfunctions are y;. Our 
goal is to take a fairly arbitrary function f defined on the interval of definition 
[a,b] and write it as 


= 2 ayy; (2) . (4.2.5) 


How can we determine the coefficients a,;? 
Fix an index n. We multiply both sides of the equation (4.2.5) by yn (x)q(x) 
and integrate from a to b. The result is 


[fs uae Jaca) ae = Laue) i aa ae. 


We switch the summation and integration operations on the right-hand side 
(a move that is justified by an advanced theorem from real analysis—see 
[KRA2]). So we now have 


| “Heitaek ade = [ ajyj(2t) ya (ata(e) de. 
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Now the orthogonality of the eigenfunctions y; tells us that each summand 
on the right vanishes except for the term with 7 = n. So we have 


b 
[ Fle un(e)a(e) de = J anun(e)yn(a)ala) de. 
Thus we have learned that 


J. f(x) yn(«)q(a) dee 
me eae a g(x) dx 


The following theorem summarizes the key facts about the series expan- 
sion (4.2.5) and the formula (4.2.6) for calculating the coefficients. 


(4.2.6) 


Theorem 4.2.7 Suppose that yi, y2, ... are the eigenfunc- 
tions of a regular Sturm—Liouville problem posed on the 
interval [a,b]. This simply means that the interval is finite 
and that p and q are positive and continuous on the en- 
tire interval. Suppose that f is a continuous function on the 
interval [a, 6]. Then the series 


») = Yoana 


with the coefficients a; calculated according to the formula 


=i@) r)y;(x)q(x) dx 
J2(yj(a))2q(2) dx 


converges at each point x of the open interval (a,b) to the 
value f(x). 


EXAMPLE 4.2.8 Consider the Sturm-—Liouville problem 
y+ rA(yY =0 


on the interval [0,Z] with boundary conditions y'(0) = y(L) = 0. It has 
eigenfunctions 
(29 — 1)ra 


2L 


(we leave this calculation as an exercise for you). Notice that, for this Sturm-— 
Liouville problem, g(x) = 1 and r(x) = 0. 


y; (x) = cos 
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The corresponding eigenfunction series expansion for a continuous func- 
tion f is 
co ° 
(27 — 1)ra 
a y dj COB 5 5 
j=l 


where 
ie ) cos[(27 — 1)7a/(2L)] dx 


o 7 any j—1)ma/(2L)| de 


But a simple calculation shows that 
L L 
7 cos?[(27 — 1)ra/(2L)| dx = 5° 
0 


So we may write 


2 L 
== / f(x) cos|(25 — 1)ra/(2L)] de 
0 
We may similarly analyze the Sturm—Liouville problem 
y’ +rA(y =0 


on the interval [0, £] with boundary conditions y(0) = y’(L) = 0. For this 
setup, the eigenfunctions (this is an exercise for you) are 


(27 — 1)rx 


yj (x) = sin 5 


and the series expansion for a continuous function f on (0, L] is 


_ = . (29-1)re 
x“) = ys aj sin ——>— 
g=l1 
with 


. . 
aj = if f(x) sin[(27 — 1)ra/(2L)] dx. a 


Exercises 


1. Explain how the equation 


4 + a(x) + (r8(@) — e))y = 0 
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can be written in the form of a Sturm—Liouville equation by setting 


p(x) = exp ([ ewer) 


Describe q(a) and r(x) in terms of a(x), G(x), and 7(x). 
2. Find a way to write the generalized Legendre equation 


dy m? 
1-27) —2 = 29— 1) - — Js] y= 
a-07) ots (nine) a) 0 


in Sturm—Liouville form. 

3. Consider the Sturm-—Liouville problem y” + Ay = 0 with endpoint con- 
ditions y(0) = 0 and y(7) + y’(7) = 0. Show that there is an infinite se- 
quence of eigenfunctions with distinct eigenvalues. What are those eigen- 
values? 

4. Show that, for any eigenvalue , the Mathieu equation has either an odd 
eigenfunction or an even eigenfunction. 

5. Consider the equation y” + Ay = 0. Verify that the eigenvalues for the 
endpoint conditions 


(a) y(0) = y(m) =0 

(b) y(0) =y'() =0 

(c) y/(0) =y(r) =0 

(d) y/(0) =y'(r) =0 

are, respectively, {k?}, {(k+1/2)?}, {(k+1/2)?}, {k?}. Can you find the 
eigenfunctions? 


6. Find all eigenvalues \ so that the differential equation y’”” + Ay = 0 has 
a nontrivial eigenfunction satisfying y(0) = y’(0) = y(m) = y'(7) = 0. 


a 
4.3 Applications of the Sturm—Liouville Theory 


In the last section of this chapter, we explore applications of Sturm—Liouville 
theory to quantum mechanics. In the present section we look at more basic 
applications to classical physics. 

Perhaps the most classical application to mechanics is the vibrating string. 
We treated the vibrating string in detail in Section 2.5, so we shall not repeat 
those ideas here. 

A second application is to the study of the longitudinal vibrations of an 
elastic bar of local stiffness p(a) and density q(x). The mean longitudinal 
displacement w(z, t) of the section of this bar from its equilibrium position x 
satisfies the wave equation (see Section 10.2) 


SE = 2 [in]. 
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The simple harmonic vibrations given by the separation of variables 
u = v(x) cos k(t — to) 
are in fact the solutions of the Sturm—Liouville equation 
d du 2 
aa nay | + k*p(x)u=0. 


Note that this new equation is in fact the special instance of our original 
equation (4.1.1) with r =0 and A = k?. 

For a finite bar, represented mathematically by the interval [a,b], there 
are various natural sets of endpoint conditions: 


e u(a) = u(b) =0 (rigidly fixed ends), 

e uw (a)=u'(b) = (free ends), 

e uw (a)+au(a) = u'(b) + Bu(b) =0 (elastically held ends), 
e u(a) = u(b), u’(a) = u'(b) (periodic constraints). 


Each one of these endpoint conditions on u implies a corresponding condi- 
tion on the original function u. The natural frequencies of longitudinal vibra- 
tion (musical fundamental tones and overtones) of a bar whose ends are held 
in one of the four manners described above are solutions of a Sturm—Liouville 
system. 

Finally we briefly discuss a (two-dimensional) vibrating membrane. The 
partial differential equation describing this situation is (with x, y being spatial 
coordinates in the plane and t being time) 


Wit = C2(Wae + Wyy) - 
In polar coordinates this would be 
Wie = C2(Wrr +r tw, +r 2wee) - 


A basis of standing wave solutions can be found with the separation of vari- 
ables 


I 


w(r, 6, t) u(r) cos n6 cos K(t — to) 


w(r,6,t) = u(r)sinnécos k(t — to). 


For w to satisfy the membrane equation with « = ck, it is enough for wu to be 
a solution of the Bessel equation (4.2.1.1). 

If the membrane we are studying is a disc with radius a, such as a vibrating 
drumhead, then the physically natural boundary conditions for this problem 
are u(a) = 0 and u(0) nonsingular. These conditions characterize the Bessel 
functions among other solutions to the Bessel equation (up to a constant 
factor). 
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EXAMPLE 4.3.1 Consider a metal bar of length L, density 6, and cross- 
sectional area A. Further assume that the bar has tensile modulus! E. The 
bar has total mass MOAL. 

The end x = 0 of the bar is fixed, and a mass m is attached to the end 
x = L. We initially stretch the bar linearly by moving the mass m a distance 
d = bL to the right. At time t = 0 the mass is released and the system is set 
in motion. We wish to determine the subsequent vibrations of the bar. 

So we need to solve the boundary value problem 


Unt = O-Upe forO<a<Landt>0 


with boundary conditions 


u(0,t) = O 
mun(L,t) = —-AEu,(L,t) 
u(z,0) = ba 
u(z,0) = 0. 


Note that the differential equation is the wave equation, which we study in 
detail in Section 10.2 below. The first, third, and fourth boundary conditions 
are physically obvious. The second one takes into account what the tensile 
modulus represents. Put in other words, what we are doing in this equation 
is equating ma = mu, for the mass m and the force F = —AEug. 

Following the method of separation of variables, we set 


u(a,t) = X(x)- T(t) 


and substitute this expression into the partial differential equation. This leads, 
as usual, to the two ordinary differential equations 


X" +X =0 (4.3.1.1) 


and 
T +re°T =0. (4.3.1.2) 


We know that 
0 = u(0,t) = X(0)T(t) 
so that X (0) = 0. Since uz = X(x)T" (t) and uz = X'(x)T(t), we can interpret 
the second boundary condition as 


mX(L)T"(t) = -AEX"(L)T(t). (4.3.1.3) 


The tensile modulus or Young’s modulus is a measure of the stiffness of an elastic 
material and is a quantity used to characterize materials. It is defined to be the ratio of 
the stress (force per unit area) along an axis to the strain (ratio of deformation over initial 
length) along that axis in the range of stress in which Hooke’s law holds. 
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But equation (4.3.1.2) tells us that 
T(t) = —ra? T(t) 


and a little physical analysis shows that a? = E’/6. So the last line gives 
T(t) = -—Ti(t). (4.3.1.4) 


Now we can substitute equation (4.3.1.4) into (4.3.1.3) to obtain 


mX(L)A2T = AEX'(L)T. 


Dividing through by ET’/6 now finally yields 
mAX(L) = AbX'(L). 


Thus we conclude that the eigenvalue problem for X (2) is 


X"+XX =0 
with the boundary conditions 
X(0) = 0 
mAX(L) = AdX'(L). 


This is not a standard Sturm-Liouville problem, just because of the pres- 
ence of in the endpoint condition for L. It can be shown, however, that 
all the eigenvalues are positive. So we may write \ = 3? and observe that 
X(x) = sin Gx satisfies X (0) = 0. 

Now our right endpoint condition yields 


m3? sin BL = AdBcos BL. 


Therefore ie og 
anpre = 
me BL 

Here M = AdL. Now set y = GL. Then the eigenvalues and eigenfunctions of 
our system are 

2 
bots 
ae 
for 7 = 1,2,... and 7; are the positive roots of the equation tan x = (M/m)/c. 

Now we perform our usual analysis on the equation 


VIX 


Xj and X,;(x) = sin ie 


ya? 
lL? 


i eee Toe 
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with endpoint condition T;(0) = 0 to find that 


T;(t) = cos (=) 


up to a multiplicative constant. 
What we have left to do now is to find coefficients a; so that the series 


oS yjat . 5x 
,t)= ; cos —— sin —— 4.3.1.5 
u(x, t) 2 cos —— sin —- ( ) 


satisfies the condition 
= eo I _ 
u(z,0) = ) a; sin ~— = f(x) = be. (4.3.1.6) 


Since our problem is not a Sturm—Liouville problem, it turns out that the 
eigenfunctions are not orthogonal. So some extra care will be required. 

In practice, from the physical point of view, we are not so much interested 
in the displacement function u(x, t) itself. Rather, we care about how the bar’s 
natural frequencies of vibration are affected by the mass m on the free end. 
Whatever the coefficients a; in equation (4.3.1.6) may turn out to be, we find 
that equation (4.3.1.5) tells us that the jth circular frequency is 


where the 7; are defined as above. This last may be rewritten as 


i Max 
cotz = ——. 
M 


We conclude then that the natural frequencies of this system are deter- 
mined by the ratio of the mass m to the total mass M of the bar. a 


a 


Exercises 


1. Consider the problem 
Ut=Urze , O<a<Lt>0, 
with endpoint conditions 


uaz (0, t) = hu(L,t) + uz(L,t) = 0 
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and 

u(x,0) = f(x). 
Find a series solution of this system. 
[Hint: Your solutions should have the form 


oo 2 kt . 
u(x,t) = SS exp (-% ) cos Fit ; 


Here the 3; are the positive roots of the equations tanxz = hL/z.] 


2. Consider the equation 
Urxe tUyy =0,0<a4<L,0<y<L, 
with the endpoint conditions 
u(0, y) = hu(L, y) + ue(L,y) = 0 


and 
u(x, L) =0 


and 
u(x, 0) = f(z). 
Find a solution as in Exercise 1. 


3. Consider the equation 
Ure = Uyy =0, 0<a<L, y>O, 
with the endpoint conditions 


u(0, y) = hu(L, y) + u2(L,y) = 0 


and 

u(x, 0) = f(x) 
and in addition u(x, y) is bounded as y > +oo. Find a solution as in 
Exercise 1. 


4. Consider the heat equation 
wu =kues, O<a<L,t>0, 

with the endpoint conditions 

hu(0,t) — u2(0,t) = 0 
and 

hu(L, t) + ua (L,t) = 0 
and 

u(x,0) = f(x). 

Find a solution as in Exercise 1. 


5. Calculate the speed in miles per hour of the longitudinal sound wave in 
the following problem. The medium is water, 6 = 1g/ cm’, and the bulk 
modulus is K = 2.25 x 10'° in cgs units. 
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6. A problem concerning the diffusion of a gas through a membrane leads 
to the boundary value problem 


ut =kure , OX a2<L,t>O0, 


with endpoint conditions 


u(0, t) =0, 
ut(L, t) + hkus(L,t) =0, 
u(z,0) =1. 


If @; are the positive roots of the equation x tan x = AL, then derive the 


solution : 
oe 2 kot : 
u(a,t) = 28 exp (- a ) sin oie : 
7. Show that the equation 
dy 


d’y 
can be written in Sturm—Liouville form. Do so by defining p(x) = 
e/ A(®) 4" What are q and r in terms of A, B, and C? 


TT 


4.4 Singular Sturm—Liouville 


In a regular or standard Sturm—Liouville problem we assume that p and q are 
nonvanishing on the entire closed interval [a,b]. In a singular Sturm—Liouville 
problem we allow vanishing at the endpoints, and this gives rise to many new 
phenomena. And many of these actually come up in physical situations. We 
consider some of them here. 


EXAMPLE 4.4.1 Consider the differential equation 
xy’ +y' +Ary=0. (4.4.1.1) 


We can rewrite this as 
—(ry')' = Ary, (4.4.1.2) 


and we assume that 0 < x < 1 and that » > 0. This equation arises in the 
study of a disc-shaped elastic membrane. 
If we introduce a new independent variable defined by t = VAz, then we 
we d d d? d? 
Y Y Yy ¥ 
ge 2 ae: 
Thus equation (4.4.1.1) becomes 


au dy t 
J a 0 
Wow atta! 
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or, simplifying, 
dy | dy 
t— + —4+ty=0. 4.4.1. 
TP + it +ty =0 ( 3) 
This last equation is Bessel’s equation of order 0. See Chapter 3 and the first 
part of Chapter 4. 


The general solution of this last equation is 
y = Ci Jo(t) + C2Yo(t) , 


where Jo is the Bessel function of the first kind and Yo is the Bessel function 
of the second kind. Hence the general solution of (4.4.1.2) is 


y = C1 Jo(V Ax) + C2Yo(V Az) - (4.4.1.4) 


It is known that 


a 23 


and 


2 Vix oe 1)9+1 Hd) 
— In — } Jo( —— >0. 
Yo(V Ac) = — (r1m4 a DIG!) , 
Here H; = 1+ (1/2)+---+ (1/9) and y = lim,_,..(H; — In j).? 
Now suppose that we seek a solution to (4.4.1.2) that satisfies the endpoint 
conditions 


(4.4.1.5) 
(4.4.1.6) 


Because Jo(0) = 1 and Yo(a) — —co as a — 0, we see that y(0) = 0 can 
hold only if C; = 0 and C2 = 0 in equation (4.4.1.4). So the boundary value 
problem given by (4.4.1.2), (4.4.1.5), (4.4.1.6) has only the trivial solution. 

We may endeavor to understand this situation by hypothesizing that the 
endpoint condition (4.4.1.5) is too restrictive for the ordinary differential equa- 
tion (4.4.1.2). This illustrates the idea that, at a singular point for the Sturm— 
Liouville problem, we need to consider a modified boundary condition. What 
we will do then is, instead of our usual endpoint conditions, we shall require 
that the solution (4.4.1.4) and its derivative remain bounded. That is to say, 
our boundary condition at « = 0 will now be 


y,y remain bounded as x — 0. 


2These are standard constructions in the subject of special functions. The ideas are due 
to Euler. 
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This last condition can be achieved by choosing Cp = O in Equation 
(4.4.1.4). For this eliminates the unbounded solution Yo. The second boundary 
condition y(1) = 0 now gives us that 


Jo(Vd) = 0. (4.4.1.7) 


It is possible to show that Equation (4.4.1.7) has an infinite set of positive 
roots,’ and this gives the sequence 


Ay <Ag<cee. 


Corresponding eigenfunctions are 
5 (x) => Jo (Vie) . 


Thus we see that the boundary value problem (4.4.1.2), (4.4.1.5), (4.4.1.6) 
is a singular Sturm—Liouville problem. We have learned that, if the boundary 
conditions for such a singular problem are relaxed in a suitable fashion, then 
we may find an infinite distinct sequence of eigenvalues and corresponding 
eigenfunctions—just as for a regular Sturm—Liouville problem. | 


Now we give some other quick examples of singular Sturm—Liouville just 
to show how naturally they fit into the context that we have been studying. 


EXAMPLE 4.4.2 The Legendre equation is given by 
(1—2*)y" — 2Qey’ + €(€+1)y=0, 2e[-1,1]. 


We can rewrite the equation as 
———|(1— 2? )y’] = e+ ly. 


Here p(x) = 1 — 2?. Since p(—1) = p(1) = 0, we see that the problem is 
singular. |_| 


EXAMPLE 4.4.3 Chebyshev’s equation is given by 
(l—2?)y” —ay’+n?y=0, x €[-1,1]. 


Dividing by V1 — x7, we can convert the equation to Sturm—Liouville form: 


2 
x i 7 
Var’ ‘aoe 


3The roots of Jo are very well understood. The first three roots VX are 2.405, 5.520, and 
8.654 (to four significant places). For j large, it is known that \/A; ¥ (j — 1/4)r. 


2, 


1—2*y 
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or 

d | 1 | oo wale 

dx Vian 7 Ta 
We see that p(x) = (1— x?)~!/? is singular at +1, so this is a singular Sturm— 
Liouville problem. | 


EXAMPLE 4.4.4 The Hermite equation is given by 
y” —2ry'+2ay=0, xe (—co,00). 


Multiplying through by e-*” we can recast the equation in Sturm—Liouville 


form: 
dp _.2, oes 
a le y'| =2ae™” y. 


This is a singular Sturm—Liouville problem because the interval is infinite. 


EXAMPLE 4.4.5 Lagrange’s equation is given by 
ry’ +(1—2)y’+ay=0, 2x € [0,0o). 


This can be converted to Sturm—Liouville form by multiplying through by 
e_*. The result is 


d -—2£ —2@ 
=a. [ze“y'] = ae~*y. 
Note that p(x) = xe~*. This is a singular Sturm—Liouville problem for two 
reasons: (i) p(0) = 0 and (ii) the interval is infinite. a 


Singular Sturm—Liouville problems arise quite frequently in applications 
and it is worthwhile to study them further. Natural questions that one would 
like to have answered are 


e What types of boundary conditions are allowable in a singular Sturm-— 
Liouville problem? 


e To what degree do the eigenvalues and eigenfunctions of a singular Sturm— 
Liouville problem behave like the eigenvalues and eigenfunctions of a regular 
Sturm-—Liouville problem? In particular, are the eigenvalues all real? Do 
they form a discrete set? Do they tend to infinity? Are the eigenfunctions 
orthogonal? Can a given function be expanded in a convergent series of 
eigenfunctions? 
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The key to answering these questions is to study the equation 


| [L(u)u — uL(v)| dx =0. (4.4.6) 


To be specific, we shall study the situation in which the endpoint condition at 
0 is singular but the endpoint condition at 1 is not. We will by default assume 
that the endpoint condition at 1 is 


byy(1) + bay’(1) =0. (4.4.7) 


We shall spend some time developing what will be an appropriate singular 
boundary condition at 0. 

One thing we must allow for is the possibility that the integral in (4.4.6) 
is singular at 0. Thus it is appropriate to instead consider fe for € > 0 small. 
At the end we let « — 0+. We assume that u, v each have two continuous 
derivatives. Thus we may integrate by parts in (4.4.6) and find that 


1 1 
‘) [L(u)u — uL(v)] dx = —p(x)[u'(x)v(x) — u(x)v'(x)]] (4.4.8) 

If both wu and v satisfy the boundary condition (4.4.7), then the boundary 

term on the right-hand side of (4.4.8) at « = 1 is 0. Thus (4.4.8) becomes 


i [L(u)v — uL(v)] daz = —p(e)[u'(e)v(e) — u(e)u'(e)] . 


Letting « — 0 then gives 


[ [L(u)v — uL(v)] dr = lim —p(€)[u’(e)v(e) — u(e)v’(e)] . 


Thus ‘ 
ih fA ON en (4.4.9) 


if and only if, in addition to the other technical hypotheses enunciated above, 
we have 
lim —p(e)[u’(€)v(e) — u(e)v’(e)] = 0 (4.4.10) 


e—0 

for every u and v in the class functions we are considering. In conclusion, 
equation (4.4.10) is our criterion for determining what conditions are allowed 
in order for 0 to be a singular boundary point. A similar condition applies at 
the boundary point 1. 

To sum up: A singular value problem for the Sturm—Liouville equation is 
said to be self-adjoint if condition (4.4.9) is satisfied—possibly as an improper 
integral—for each pair of functions u and v selected as follows: 


(a) Each of u and v is twice continuously differentiable on (0, 1). 
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(b) 
(c) 


Notice, for instance, in the example that we worked out above, that p(x) = 
x vanishes at 0. So 0 is a singular boundary point. But it is perfectly clear that 
Equation (4.4.10) holds. So that singular Sturm—Liouville problem is certainly 


They satisfy a boundary condition of the form b,y(1) + bey’(1) = 0 
at each regular boundary point. 


They satisfy a boundary criterion like (4.4.10) at each singular 
boundary point. 


self-adjoint. 


a 


Exercises 


1. 


Find a formal solution of the nonhomogeneous boundary value problem 


—(xy')’ = way + f(a), 


where y and y’ are bounded as x — 0 and y(1) = 0. Also, f is a contin- 
uous function on [0,1] and yw is not an eigenvalue of the corresponding 
homogeneous problem. 


The equation 


(1—2?)y" — ay! + Ay =0 
is Chebychev’s equation. 


(a) Show that this ODE can be written in the form 
-[—2?)/?y)' =A —2?)/?%y, -1<a<l. (x) 
b) Consider the boundary conditions 
y 


y, y bounded as > —1 (+) 


and 
y, y bounded as x > 1. (22) 


Show that the boundary value problem given by (x), (**), («**) is 
self-adjoint. 

(c) It can be verified that the boundary value problem given by (x), (**), 
(***) has the eigenvalues Ao = 0, A1 = 1, A2 = 4, and in general A; = 
j*. The corresponding eigenfunctions are the Chebyshev polynomials 
T(x), where To(x) = 1, Ti(x) = x, T2(x) = 1 — 2x7, and so forth. 


Prove that ; 
T3(z)Tk(x) 
[tienen 


for 7 # k. Technically speaking, this is a convergent improper inte- 
gral. 
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3. Consider Legendre’s equation 


—[(L- 2" )y'! = dy. 


We subject this equation to the boundary conditions y(0) = 0 and y and 
y’ are bounded as x — 1. The eigenfunctions of this problem are the 
Legendre polynomials ¢o0(x) = 1, ¢1(@) = Pi(x) = a, ¢o(x) = P3(x) = 
(5a? — 3x) /2, and in general $;(x) = P2;-1(a). Here 


1 d” 2\n 
P,(«) = —— — 
(2) 2” n! dx” K ) 
The ¢; correspond to the eigenvalues \1 = 2, A2 = 4-3, ..., Aj = 


2j(27 — 1). 
(a) Show that 
1 
| es(@)oxla) de = 0 
0 
when j # k. 
(b) Find a formal solution of the nonhomogeneous problem 
[(1—2*)y'! = wy + f(e), 


where y(0) = 0, and y, y’ are bounded as x — 1. Also, f is a continu- 
ous function on [0,1], and yp is not an eigenvalue of the corresponding 
homogeneous problem. 


4. Show that the eigenvalues of the singular Sturm—Liouville system defined 
by 
d 
dx 
with the additional condition that u be bounded on (—1,1), are given by 
Aj = 5/G + 2a). 

5. Refer to Exercise 3 for terminology. Show that the Legendre polynomials 
(and their constant multiples) are the only solutions of the Legendre ODE 
that are bounded on (—1, 1). 

6. Show that the Sturm—Liouville system given by 


la os one +X1—2?)*u=0, a> -1, 


[(x@ — a)(b— x)u']’ + ru =0 


for a < b, with u(x) bounded on (a,b), has eigenvalues \ = 4j(j + 1)(b— 
a)*. Describe the eigenfunctions. 


7. Consider the singular Sturm—Liouville system 
(xe~*u')’ + Ae *u =0 


on the interval (0,-++co). We impose the endpoint conditions that u(0*) 
be bounded and further that e~”u(x) — 0 as x — +oo. Is it true that 
the values A = j give polynomial eigenfunctions? 
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(I 
4.5 Some Ideas from Quantum Mechanics 


Sturm—Liouville problems arise in many parts of mathematical physics—both 
elementary and advanced. One of these is quantum mechanics. Here we see 
how the study of matter on a very small scale can be effected with some of the 
ideas that we have been learning. We derive our exposition from the lovely 
book [KBO]. 

We think of a system as being specified by a state function w(r,t). Here 
r represents a position vector for a point in space (of dimension one, two, 
or three), and t is time. We think of ~ as a probability distribution, so it is 
appropriate to assume that 


1= (0) = [flee OP ar. 


That is, for each fixed time t, w has total mass 1. 
One of the basic tenets of quantum mechanics is that, in each system, 
there is a linear operator H such that 


Here h is a constant specified by the condition that 
Ith = 6.62-10-16 J - s. 


Here J-s stands for Joule seconds. The actual value of the constant h is of no 
interest for the present discussion. The operator H has the nice (Hermitian) 
property that 

(Hy1,y2) = (yi, Hye) - 


Finally—and this is one of the key ideas of von Neumann’s model for 
quantum mechanics—to each observable property of the system there cor- 
responds a linear, Hermitian operator A. Moreover, any measurement of the 
observable property gives rise to an eigenvalue of A. For example, as you learn 
in your physics course, the operator that corresponds to momentum is —ihV 
and the operator that corresponds to energy is ihO/0t. 

Now that we have dispensed with the background, let us examine a specific 
system and see how a Sturm-—Liouville problem comes into play. Consider 
a particle of mass m moving in a potential field V(r,t). Then, if p is the 
momentum of the particle, we have that 

2 
E=" 40v¢,¢). (4.5.1) 


2m 


Observe that the first expression on the right is the kinetic energy and the 
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second expression is the potential energy. Now we quantize this classical rela- 
tion by substituting in appropriate operators for the potential and the energy. 
We then obtain ay 
ies 2 py + V(r, t)y. (4.5.2) 
Ot 2m 
This important identity is known as Schrédinger’s equation. It controls the 
evolution of the wave function. 
To simplify matters, let us suppose that the potential V does not depend 
on time t. We may then seek to solve (4.5.2) by separation of variables. Let us 
write ~(r,t) = a(r)T(t). Substituting into the differential equation, we find 


that 
2 


iRa(e) = ~ Va -T4+V(r)a(r)T 
Z(aT/di(t) 
ih Te) ale)” (r) + V(r). (4.5.3) 


Observe that the left-hand side depends only on t, and the right-hand side 
only on r. Thus both are equal to some constant j. 
In particular, 


So we may write 


7 Ov/at 7 —dT /dt Z 
a v =4 pS iv 
hence 
OW 


ih— = ww. 
ie, pu 


Thus we see that js is the energy of the particle in our system. 
Looking at the right-hand side of (4.5.3), we now see that 


2 


-+ Va iVG— Dex, (4.5.4) 


This is the time-independent Schrodinger equation. It will turn out, contrary to 
the philosophy of classical physics, that the energy of this one-particle system 
must be one of the eigenvalues of the boundary value problem that we shall 
construct from equation (4.5.4). The energy is said to be quantized. 

To consider a simple instance of the ideas we have been discussing, we 
examine a particle of mass m that is trapped in a region of zero potential by 
the infinite potentials at « = 0 and x =a. See Figure 4.1. Let us consider the 
possible energies that such a particle can have. 

Thinking of w as a (continuous) probability distribution, we see that 7 = 0 
outside the interval (0,a), since the probability is zero that the particle will 
be found outside the interval. Thus the graph of w is as in Figure 4.2. 
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y 


FIGURE 4.1 
Particle trapped in a region of zero potential. 


FIGURE 4.2 
Graph of the continuous probability distribution w. 
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Thus our eigenvalue problem simplifies to 
h aa 

ee ey 

2m ou BO 


subject to the boundary conditions a(0) = a(a) = 0. Of course this is a 
familiar problem. Observe that the role of y: in that example is now being 
played by 2my/h? and the role of the function y is now played by a. Thus 


we find that the allowed energies in our system are simply h /2m times the 


eigenvalues, or fin = Re n2n? /[2ma?]. 


SS 


Problems for Review and Discovery 


A. Drill Exercises 


1. Solve the equation 


3 
W Qn += / Ay = 
y (2e ape y=0 


by finding a simple solution of the adjoint equation. 


2. Find the eigenvalues and eigenfunctions for the problem 
u'+Mu=0 ,0<a<a 


with the endpoint conditions u(0) = 0, u’(a) = 0. 
3. Find the eigenvalues and eigenfunctions for the problem 


u'+Mu=0 ,0<a<a 
with the endpoint conditions u’(0) = 0, u(a) = 0. 
4. Find the eigenvalues and eigenfunctions for the problem 
u'+Mu=0 ,0<a<a 
with the endpoint conditions u(0) — u’(0) = 0, u(a) + u’(a) = 0. 
5. Show that the eigenfunctions for the equation 
u" +M2(14+a)u=0 


with u(0) = 0, u’(a) = 0 are orthogonal. 
6. Show that the eigenfunctions for the equation 
2 
” 


» 
Os 


with u(1) = 0, u’(2) = 0 are orthogonal. 
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B. Challenge Problems 


1. 


The equation 
xy” oh xy’ _ ny =0 
is Euler’s equidimensional equation. Here n is a positive integer. Deter- 


mine those values of n for which the equation is exact (meaning that 


the equation can be written in the form 4[A(zx)y’ + B(x)y]). For those 


values, find the general solution of the equation. 
Show that a self-adjoint differential equation can be written in the form 


[P(x)y’]’ + R(x)y =0. 


This is called the standard form for a self-adjoint equation. 
Show that any equation 


P(x)y" + Q(x)y' + R(a)y = 0 
can be made self-adjoint by multiplying through by the factor 


1 S(Q/P) ae 
P 


Legendre’s equation 
(1—2*)y" — 2ay’ + p(p +1) =0 


is self-adjoint. Put it into standard form as described in Exercise 2. 


Bessel’s equation 


xy" + ay’ + (x —p*)y =0 
is not self-adjoint. Use the technique of Exercise 3 to make it self-adjoint, 
and then put the equation in standard form as described in Exercise 2. 
Hermite’s equation 


” 


y” — Qay' + (x + 2py =0 


is not self-adjoint. Use the technique of Exercise 3 to make it self-adjoint, 
and then put the equation in standard form as described in Exercise 2. 


C. Problems for Discussion and Exploration 


1. 


Consider the differential equation y” + Ay = 0 with the endpoint con- 
ditions y(0) = 0 and y(m) + y'(7) = 0. Show that there is an infinite 
sequence of eigenfunctions with distinct eigenvalues. Identify the eigen- 
values explicitly. 

In what sense is the theory of Sturm—Liouville equations a generalization 
of the Fourier theory developed in Chapter 6? Answer this question in 
as much detail as you can. Where do the exponentials e’? come from? 
From which Sturm-—Liouville problem? 

Read about the Fourier transform in the last section of Chapter 6. Now 
determine which functions f have the property that f = f. If, instead 
of using the notation, we denote the Fourier transform by F, then 
show that Fo FoF of(f) = f for any function f. Conclude from this 
last calculation that there should be a function g with F(g) = —g anda 
function h with F(h) = th and a function k with F(k) = —tk. 


Taylor & Francis 
Taylor & Francis Group 
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Numerical Methods 


The idea of a numerical method 


e Approximation 


Error terms 


e Euler’s method 


e Improved Euler method 


Runge-Kutta method 


The presentation in this book, or in any standard introductory text on 
differential equations, can be misleading. A casual reading might lead the stu- 
dent to think that “most” differential equations can be solved explicitly, with 
the solution given by a formula. Such is not the case. Although it can be proved 
abstractly that most any ordinary differential equation has a solution—at least 
locally—it is in general quite difficult to say in any explicit manner what the 
solution might be. It is sometimes possible to say something qualitative about 
solutions. And we have also seen that certain important equations that come 
from physics are fortuitously simple, and can be attacked effectively. But the 
bottom line is that many of the equations that we must solve for engineering 
or other applications simply do not have closed-form solutions. Just as an 
instance, the equations that govern the shape of an airplane wing cannot be 
solved. Yet we fly every day. How do we come to terms with the intractability 
of differential equations? 

The advent of high-speed digital computers has made it both feasible and, 
indeed, easy to perform numerical approximation of solutions. The subject 
of the numerical solution of differential equations is a highly developed one, 
and is applied daily to problems in engineering, physics, biology, astronomy, 
and many other parts of science. Solutions may generally be obtained to any 
desired degree of accuracy, graphs drawn, and any desired analysis performed. 

Not surprisingly—and like many of the other fundamental ideas related 
to calculus—the basic techniques for the numerical solution of differential 
equations go back to Newton and Euler. This is quite amazing, for these men 
had no notion of the computing equipment that we have available today. Their 
insights were quite prescient and powerful. 
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In the present chapter, we shall only introduce the most basic ideas in the 
subject of numerical analysis of differential equations. We refer the reader to 
[GER], [HIL], [ISK], [STA], [TOD] for further development of the subject. 


(I 


5.1 Introductory Remarks 


When we create a numerical or discrete model for a differential equation, we 
make several decisive replacements or substitutions. First, the derivatives in 
the equation are replaced by differences (as in replacing the derivative by a 
difference quotient). Second, the continuous variable x is replaced by a discrete 
variable. Third, the real number line is replaced by a discrete set of values. 
Any type of approximation argument involves some sort of loss of information; 
that is to say, there will always be an error term. It is also the case that these 
numerical approximation techniques can give rise to instability phenomena 
and other unpredictable behavior. 

The practical significance of these remarks is that numerical methods 
should never be used in isolation. Whenever possible, the user should also 
employ qualitative techniques. Endeavor to determine whether the solution is 
bounded, periodic, or stable. What are its asymptotics at infinity? How do 
the different solutions interact with each other? In this way the scientist is 
not using the computing machine blindly, but is instead using the machine to 
aid and augment his/her understanding. 

The spirit of the numerical method can be illustrated with a basic exam- 
ple. Consider the simple differential equation 


y=y, y(0)=1. 


The initial condition tells us that the point (0,1) lies on the graph of the 
solution y. The equation itself tells us that, at that point, the slope of the 
solution is 
y=y=l. 

Thus the graph will proceed to the right, with slope 1. Let us assume that we 
shall do our numerical calculation with mesh 0.1. So we proceed to the right 
to the point (0.1, 1.1). This is the second point on our “approximate solution 
graph.” 

Now we return to the differential equation to obtain the slope of the 
solution at this new point. It is 


y =y=11. 


Thus, when we proceed to sketch our approximate solution graph to the right 
of (0.1, 1.1), we draw a line segment of slope 1.1 to the point (0.2,1.21). And 
so forth. See Figure 5.1. 
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FIGURE 5.1 
A simple approximation scheme. 


Of course this is a very simple-minded example, and it is easy to imagine 
that the approximate solution is diverging rather drastically and unpredictably 
with each iteration of the method. In subsequent sections we shall learn tech- 
niques of Euler (which formalize the method just described) and Runge-Kutta 
(which give much better, and more reliable, results). 


5.2 The Method of Euler 


Consider an initial value problem of the form 


y’ = f(z,y), y(Xo) = Yo- 


We may integrate from x9 to x1 = 29 +h to obtain 


ular) — (vo) = f vide f(x,y) dx 


x 


0 


or 
xy 


y(@1) = y(%o) + f(x,y) dz. 


xo 
Since the unknown function y occurs in the integrand on the right, we cannot 
proceed unless we have some method of approximating the integral. 
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The Euler method is obtained from the most simple technique for approxi- 
mating the integral. Namely, we assume that the integrand does not vary much 
on the interval [vo, x1], and therefore that a rather small error will result if 
we replace f(z, y) by its value at the left endpoint. To wit, we put in place a 
partition a = xp < 4 < 4 <-++: < x, = 0 of the interval [a,b] under study. 
We assume that the x; are equally spaced, with |x; — #;-1| = h for every j. 
We set yo = y(xo). Now we take 


xy 


y(t1) = y(xo) + f(x,y) dx 


xo 
m=  y(ao) + f (£0, yo) da 


eae) 


= yoth- f(xo, yo). 


Based on this calculation, we simply define 


Y¥1 = Yo +h- f(xo, yo) - 


Continuing in this fashion, we think of x, = x,_1 + h and define 


Yre1 = Ue + h- f(eayye) 


Then the points (xo, yo), (1, Y1),---;(@k,Yk),--- are the points of our “ap- 
proximate solution” to the differential equation. Figure 5.2 illustrates the exact 
solution, the approximate solution, and how they might deviate. 

It is sometimes convenient to measure the total relative error FE, at the 
nth step; this quantity is defined to be 


mr _ ly(zn) — Ynl 
Fe ele 


We usually express this quantity as a percentage, and we obtain thereby a 
comfortable way of measuring how well the numerical technique under con- 
sideration is performing. 

Now we are going to focus on a particular ordinary differential equation 
that will be the benchmark for all of our numerical techniques. Throughout 
this chapter, we are going to examine the initial value problem 


y=aty, y(0)=1 


over and over again using different methods of numerical analysis. Our bench- 
mark will be to calculate y(1) numerically and compare it with the exact value 
of y(1) that we may obtain by an explicit solution method. 


EXAMPLE 5.2.1 Apply the Euler technique to the ordinary differential equa- 
tion 
y =axrt+y, y(0) =1 (5.2.1.1) 


using increments of size h = 0.2 and h=0.1. | 
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Error at second step 


Errof at first step 


FIGURE 5.2 
Euler’s approximation scheme. 


Table 5.2.1.2. Tabulated values for exact and numerical 
solutions to equation (5.2.1.1) with h = 0.2 


Ln Yn Exact En, (%) 
0.2 1.20000 1.24281 3.4 
0.4 1.48000 1.5836 6.5 
0.6 1.85600 2.04424 9.2 
0.8 2.720 2.65108 11.5 
1.0 2.97664 3.43656 13.4 


Solution: Of course we use the fact that an explicit solution of the differential 
equation (which is first-order linear) with initial condition is given by 


y= —-“%—1+2e”. 


We exhibit the calculations in Table 5.2.1.2. In the first line of this table, 
the initial condition y(0) = 1 determines the slope y’ = x+y = 1.00. Since h = 
0.2 and y; = yoth-f (xo, yo), the next value is given by 1.00+0.2-(1.00) = 1.20. 
This process is iterated in the following lines. As noted above, the expression 
E,, represents the percent of error. For instance, in the second line of the table 
it is calculated as 

— 1.24281 — 1.2 


1 = 794981 = .034446... 
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and we represent the quantity in the table as a percent—so it is 3.4%. We 
shall retain five decimal places in this and succeeding tables. 

For comparison purposes, we also record in Table 5.2.1.3 the tabulated 
values for h = 0.1. That is the solution of this example. | 


The displayed data makes clear that reducing the step size will increase 
accuracy. But the trade-off is that significantly more computation is required. 
In the next section we shall discuss errors, and in particular at what point 
there is no advantage to reducing the step size. 


Table 5.2.1.3. Tabulated values for exact and numerical 
solutions to equation (5.2.1.1) with h = 0.1 


Ln Yn Exact En (%) 
0.0 1.00000 1.00000 0.0 
0.1 1.10000 1.110 0.9 
0.2 1.22 1.24281 1.8 
0.3 1.362 1.39972 2.7 
0.4 1.52820 1.581 3.5 
0.5 1.72102 1.79744 4.3 
0.6 1.94312 2.04424 4.9 
0.7 2.19743 2.32751 5.6 
0.8 2.48718 2.65108 6.2 
0.9 2.81590 3.01921 6.7 
1.0 3.18748 3.41 7.2 


Exercises 


For each of Exercises 1-5, use the Euler method with h = 0.1,0.05, and 0.01 to 
estimate the solution at z = 1. In each case, compare your results to the exact 
solution and discuss how well (or poorly) the Euler method has worked. 


1. y =2r+2y, y(0)=1 

2. y=1/y, y(0)=1 

3. y’=e", y(0) =0 

4, y'’=y-sinz, y(0)=-1 

5. y'=(a@t+y—1), (0) =0 

6. Refer to Figure 5.2. Use geometric arguments to determine for what kind 


of exact solutions the Euler method would give accurate results. Do these 
results depend on fh in any way? Construct two different examples to 
illustrate your point. 
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7. The ordinary differential equation 
y =y(1—-y’) 
possesses two equilibrium solutions: the solution ¢; = 0, which is un- 
stable, and the solution @2 = 1, which is stable. With the initial condi- 
tion y(0) = 0.1, predict what should happen to the solution. Then, with 
h = 0.1, use the Euler method to run the solution out to x = 3. What 
happens to this numerical solution? 


8. This exercise illustrates the danger in blindly applying numerical tech- 
niques. Apply the Euler method to the following initial value problem. 


y’ = sec’ x, y(0) =0. 


Use a step size of h = 0.1 and determine the numerical solution at x = 1. 
Now explain why the initial value problem actually has no solution at 
el. 


TT 


5.3. The Error Term 


The notion of error is central to any numerical technique. Numerical methods 
only give approximate answers. In order for the approximate answer to be 
useful, we must know how close to the true answer it is. Since the whole 
reason that we went after an approximate answer in the first place was that 
we had no method for finding the exact answer, this whole discussion raises 
tricky questions. How do we get our hands on the error, and how do we 
estimate it? Any time decimal approximations are used, there is a rounding 
off procedure involved. Round-off error is another critical phenomenon that 
we must examine. 


EXAMPLE 5.3.1 Examine the differential equation 
y =ar+y, g(0 ot (S317) 
and consider what happens if the step size h is made too small. 


Solution: Suppose that we are working with a computer having ordinary 
precision—which is eight decimal places. This means that all numerical an- 
swers are rounded to eight places. 

Let h = 10~1!°, a very small step size indeed (but one that could be 
required for work in microtechnology). Let f(z,y) = «+ y. Applying the 
Euler method and computing the first step, we find that the computer yields 


yi = yoth-: f(zo,yo) =14+107' =1. 
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The last equality may seem rather odd—in fact it appears to be false. But 
this is how the computer will reason: It rounds to eight decimal places! The 
same phenomenon will occur with the calculation of yg. In this situation, we 
see therefore that the Euler method will produce a constant solution—namely, 
y = 1. And of course that is not a solution at all. | 


The last example is to be taken quite seriously. It describes what would 
actually happen if you had a canned piece of software to implement Euler’s 
method, and you actually used it on a computer running in the most standard 
and familiar computing environment. If you are not aware of the dangers of 
round-off error, and why such errors occur, then you will be a very confused 
scientist indeed. One way to address the problem is with double precision, 
which gives 16-place decimal accuracy. Another way is to use a symbol ma- 
nipulation program like Mathematica or Maple (in which one can pre-set any 
number of decimal places of accuracy). 

In the present book, we cannot go very deeply into the subject of round- 
off error. What is most feasible for us is to acknowledge that round-off error 
must be dealt with in advance, and we shall assume that we have set up our 
problem so that round-off error is negligible. We shall instead concentrate 
our discussions on discretization error, which is a problem less contingent on 
artifacts of the computing environment and more central to the theory. 

The local discretization error at the nth step is defined to be €, = y(an) — 
Yn. Here y(xp) is the exact value at x, of the solution of the differential 
equation, and y,, is the Euler approximation. In fact we may use Taylor’s 
formula to obtain a useful estimate on this error term. To wit, we may write 


h2 
y(ao +h) = yo th-y' (xo) + ar y"(E), 


for some value of € between x and 29) +h. But we know, from the differential 
equation, that 
y'(xo) = f (Xo, yo) - 


Thus 42 
y(zo + h) = yo + h- f(xo, yo) + > ap LEN 
so that 
h? A h? 
y(x1) = y(to +h) = yo th: f(xo,yo) +o y (=m +> -y ()- 
We may conclude that 
== ye 
5 2 Yy : 


Usually on the interval [zo,2,] we may see on a priori grounds that |y’’| is 
bounded by some constant M. Thus our error estimate takes the form 


ley] < 
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More generally, the same calculation shows that 


lej| < 


Such an estimate shows us directly, for instance, that if we decrease the step 
size from h to h/2 then the accuracy is increased by a factor of 4. 

Unfortunately, in practice, things are not as simple as the last paragraph 
might suggest. For an error is made at each step of the Euler method—or of any 
numerical method—so we must consider the total discretization error. This is 
just the aggregate of all the errors that occur at all steps of the approximation 
process. 

To get a rough estimate of this quantity, we notice that our Euler scheme 
iterates in n steps, from Xo to &p, in increments of length h. So h = [%p—2o]/n 
or n = |%p — %ol/h. If we assume that the errors accumulate without any 
cancellation, then the aggregate error is bounded by 


Mh? Mh 
|E,| <n- 5 = (pn — Zo) — =C ch. 


Here C = (%pn — Xo): M/2, and (ap, — Xo) is of course the length of the interval 
under study. Thus, for this problem, C is a universal constant. We see that, 
for Euler’s method, the total discretization error is bounded by a constant 
times the step size. 


EXAMPLE 5.3.2 Estimate the discretization error, for a step size of 0.2 and 
for a step size of 0.1, for the differential equation with initial data given by 


y=axt+y, y(0) =1. (5.3.2.1) 


Solution: In order to get the maximum information about the error, we are 
going to proceed in a somewhat artificial fashion. Namely, we shall use the 
fact that we can solve the initial value problem explicitly: The solution is given 
by y = 2e” — a — 1. Thus y” = 2e*. Thus, on the interval [0, 1], 


la" |< Beha De. 


Hence 


for each j. The total discretization error is then bounded (since we calculate 
this error by summing about 1/h terms) by 


|En| < eh. (5.3.2.2) 


Referring to Table 5.2.1.2 in Section 5.2 for incrementing by h = 0.2, 
we see that the total discretization error at « = 1 is actually equal to 0.46 
(rounded to two decimal places). (We calculate this error from the table by 
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subtracting y,, from the exact solution.) The error bound given by (5.3.2.2) 
is e- (0.2) = 0.54. Of course the actual error is less than this somewhat crude 
bound. With h = 0.1, the actual error from Table 5.2.1.2 is 0.25 while the 
error bound is e- (0.1) = 0.27. a 


REMARK 5.3.3 In practice, we shall not be able to explicitly solve the differ- 
ential equation being studied. That is, after all, why we are using numerical 
techniques and a computer. So how do we, in practice, determine when h is 
small enough to achieve the accuracy we desire? A rough-and-ready method, 
that is used commonly in the field, is this: Do the calculation for a given h, 
then for h/2, then for h/4, and so forth. When the distance between two suc- 
cessive calculations is within the desired tolerance for the problem, then it is 
quite likely that they both are also within the desired tolerance of the exact 
solution. 


REMARK 5.3.4 How do we, in practice, check to see whether h is too small, 
and thus causing round-off error? One commonly used technique is to re-do 
the calculation in double precision (on a computer using one of the standard 
software packages, this would mean 16-place decimal accuracy instead of the 
usual 8-place accuracy). If the answer seems to change substantially, then some 
round-off error is probably present in the regular precision (8-place accuracy) 
calculation. 


a 


Exercises 


In each of Exercises 1-5, use the exact solution, together with step sizes h = 0.2 and 
0.1, to estimate the total discretization error that occurs with the Euler method at 
ra 1: 


y =2x+2y, y(0)=1 
y =1/y, y(0)=1 

y' =e", (0)=0 

y =y-sinz, y(0)=—1l 
y =(e+y-1), y(0)=0 


oe Ne 


6. Consider the problem y’ = sin3rzx with y(0) = 0. Determine the exact 
solution and sketch the graph on the interval 0 < x < 1. Use the Euler 
method with h = 0.2 and h = 0.1 and sketch those results on the same set 
of axes. Compare and discuss. Now use the results of the present section 
of the text to determine a step size sufficient to guarantee a total error of 
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0.01 at = 1. Apply the Euler method with this step size, and compare 
with the exact solution. Why is this step size necessarily so small? 


[2 ——— ee 


5.4 An Improved Euler Method 


We improve the Euler method by following the logical scheme that we em- 
ployed when learning numerical methods of integration in calculus class. 
Namely, our first method of numerical integration was to approximate a de- 
sired integral by a sum of areas of rectangles. (This is analogous to the Euler 
method, where we approximate the integrand by the constant value at its left 
endpoint.) Next, in integration theory, we improved our calculations by ap- 
proximating by a sum of areas of trapezoids. That amounts to averaging the 
values at the two endpoints. This is the philosophy that we now employ. 
Recall that our old equation is 


a1 
n=w-+ f f(a, y) dz. 
xo 


Our idea for Euler’s method was to replace the integrand by f(2xo, yo). This 
generated the iterative scheme of the last section. Now we propose to instead 
replace the integrand with [f (vo, yo) + f(%1, y(1))|/2. Thus we find that 


h 
ya = yo + S/F (20, Yo) + F(@1, y(a1))] (5.4.1) 
The trouble with this proposed equation is that y(a1) is unknown—just 
because we do not know the exact solution y. What we can do instead is to 
replace y(21) by its approximate value as found by the Euler method. Denote 
this new value by z1 = yo +h: f(xo0, yo). Then (5.4.1) becomes 


yl = Yo “ ‘[f(xo, yo) + f(a1, 21) - 


The reader should pause to verify that each quantity on the right-hand side 
can be calculated from information that we have—without knowledge of the 
exact solution of the differential equation. More generally, our iterative scheme 
is 


h 
Wr =Uits- [f(xj,y5) + f(ej41, 241) 


where 
zy = yj th- f(xj, ys) 
and j = 0,1,2,.... 
This new method, usually called the improved Euler method or Hewn’s 
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Corrected slope f(x ,, Z ,) 


Error at first step 


FIGURE 5.3 
The improved Euler method. 


method, first predicts and then corrects an estimate for y;. It is an example of a 
class of numerical techniques called predictor-corrector methods. It is possible, 
using subtle Taylor series arguments, to show that the local discretization 
error is 13 
Coe 
for some value of € between x and x,. Thus, in particular, the total dis- 
cretization error is proportional to h? (instead of h, as before), so we expect 
more accuracy for the same step size. Figure 5.3 gives a way to visualize the 
improved Euler method. First, the point at (x1, 21) is predicted using the orig- 
inal Euler method, then this point is used to estimate the slope of the solution 
curve at x1. This result is then averaged with the original slope estimate at 
(29, yo) to make a better prediction of the solution—namely, (1, y1). 

We shall continue to examine our old friend 


y=arty, y(0)=1 

and use the value y(1) as a benchmark. 

EXAMPLE 5.4.1 Apply the improved Euler method to the differential equation 
y =«t+y, y(0) =1 (5.4.1.1) 


with step size 0.2 and gauge the improvement in accuracy over the ordinary 
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Euler method used in Examples 5.2.1 and 5.3.2. 


Tabulated values for exact and numerical 
solutions to (5.4.1.1) with h = 0.2 
using the improved Euler method 


In Yn 

0.0 1.00000 
0.2 1.24000 
0.4 1.57680 
0.6 2.03170 
0.8 2.63067 
1.0 3.40542 


Solution: We see (remembering that f(x,y) = x+y) that 


Ze = ye + 0.2- f (ee, ye) = yx + 0.2+ (ee + ye) 


and 


Yeo = yk + 0.1 (ee + ye) + (@e41 + 2e41)]- 


Table 5.4.1.2 


Exact 
1.00000 
1.24281 

1.581.12.25 
2.04424 
2.65108 

3.41.12.256 


En (%) 


0.00 
0.23 
0.43 
0.61 
0.77 
0.91 


213 


We begin the calculation by setting k = 0 and using the initial values 


xo = 0.0000, yo = 1.0000. Thus 


z, = 1.0000 + 0.2 - (0.0000 + 1.0000) = 1.2000 


and 


y = 1.0000 + 0.1 - [(0.0000 + 1.0000) + (0.2 + 1.2000)] = 1.2400. 


We continue this process and obtain the values shown in Table 5.4.1.2. 


We see that the resulting approximate value for y(1) is 3.40542. The ag- 
gregate error is about 1 percent, whereas with the former Euler method it was 


more than 13 percent. This is a substantial improvement. 


Of course a smaller step size results in even more dramatic improvement 
in accuracy. Table 5.4.1.3 displays the results of applying the improved Euler 
method to our differential equation using a step size of h = 0.1. The relative 
error at x = 1.00000 is now about 0.2 percent, which is another order of 
magnitude of improvement in accuracy. We have predicted that halving the 
step size will decrease the aggregate error by a factor of 4. These results bear 


out that prediction. 
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Table 5.4.1.3 
Tabulated values for exact and numerical 
solutions to (5.4.1.1) with h = 0.1 
using the improved Euler method 


In Yn Exact En (%) 
0.0 1.00000 1.00000 0.0 
0.1 1.11000 1.11034 0.0 
0.2 1.24205 1.24281 0.1 
0.3 1.39847 1.39972 0.1 
0.4 1.58180 1.5836 0.1 
0.5 1.79489 1.79744 0.1 
0.6 2.04086 2.04424 0.2 
0.7 2.32315 2.32751 0.2 
0.8 2.64558 2.65108 0.2 
0.9 3.0121.12.2 3.01921 0.2 
1.0 3.42816 3.43656 0.2 


In the next section we shall use a method of subdividing the intervals of 
our step sequence to obtain greater accuracy. This results in the Runge-Kutta 
method. 


Math Nugget 


Carl Runge (1856-1927) was professor of applied mathe- 
matics at Gottingen from 1904 to 1925. He is known for his 
work in complex variable theory, and for his discovery of a 
theorem that foreshadowed the famous Thue-Siegel—Roth 
theorem in diophantine equations. He also taught Hilbert 
to ski. M. W. Kutta (1867-1944), another German applied 
mathematician, is remembered for his contribution to the 
Kutta—Joukowski theory of airfoil lift in aerodynamics. 
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Runge’s name is also remembered in connection with an 
incident involving the distinguished mathematician Gabor 
Szegd. While returning on a train from a conference, Szeg6 
got into a fistfight with a young man who was sharing his 
compartment (it seems that the point at issue was whether 
the window should remain open or closed). Szegé had been 


a wrestler, and he did quite well in the fisticuffs. Seems 
that the young man came from a wealthy and influential 
family, one that was particularly important in Gottingen. 
So Szegé was brought up on charges. Now Runge’s father- 
in-law was an attorney, and he defended Szeg6—but to no 
avail. Szegé had to leave Gottingen, and ultimately ended 
up at Stanford. 


a 


Exercises 


For each of Exercises 1-5, use the improved Euler method with h = 0.1,0.05, and 
0.01 to estimate the solution at « = 1. Compare your results to the exact solution 
and the results obtained with the original Euler method in Exercises 1-5 of Section 
5.2. 


y =2a+2y, y(0)=1 
y=1/y, y(0)=1 
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5.56 The Runge-Kutta Method 


Just as the trapezoid rule provides an improvement over the rectangular 
method for approximating integrals, so Simpson’s rule gives an even better 
means for approximating integrals. With Simpson’s rule we approximate not 
by rectangles or trapezoids but by parabolas. 

Check your calculus book (for instance, [STE, p. 421] to review how Simp- 
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son’s rule works. When we apply it to the integral of f, we find that 


[fle u)ae = Zs e0.ye) + 4f(erja.ylerj2)) + flerveer)))- 65-1) 


Here 21/2 = % + h/2, the midpoint of ao and 2}. 

The Runge-Kutta method proceeds analogously. We cannot provide all 
the rigorous details of the derivation of the fourth-order Runge-Kutta method. 
We instead give an intuitive development. 

Just as we did in obtaining our earlier numerical algorithms, we must 
now estimate both y;/2 and y;. The first estimate of y;/2 comes from Euler’s 
method. Thus 

my 
Yi/2 = Yo + a 
Here 
mi =h- f(xo, yo). 


(The factor of 1/2 here comes from the step size from a9 to %1/.) To correct 
the estimate of y;/2, we calculate it again in this manner: 
m2 


Yi/2 = Yo + 5 


where 
m2 = h- f(ao + h/2, yo + m1/2) x 


Now, to predict y1, we use the expression for y;/2 and the Euler method: 


™3 
’ 


Yi = 41/2 + 5 


where m3 = h- f (ap + h/2, yo + m2/2). 
Finally, let m4 = h- f(xo +h, yo + m3). The Runge-Kutta scheme is then 
obtained by substituting each of these estimates into (5.5.1) to yield 


1 
Yt = Yor radi + 2m + 2m3+ma4). 
Just as in our earlier work, this algorithm can be applied to any number of 
mesh points in a natural way. At each step of the iteration, we first compute 
the four numbers m1,m2,m3,ma given by 


m, = A- f (re, ye) 


h 
ee ee 
h =) 


m3 = nes (a =, Uk 


2 
ma = Ah: flap th, ye +ms). 
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Then yz41 is given by 


1 
Yr+i = Yr + 5m + 2m2 + 2m3 + ma) : 
This new analytic paradigm, the Runge-Kutta technique, is capable of 
giving extremely accurate results without the need for taking very small values 
of h (thus making the work computationally expensive). The local truncation 


error is 
_ yO) ne 
180 , 
where € is a point between xp and x,. The total truncation error is thus of 
the order of magnitude of h*. 
Now let us apply our new methodology—by far the best one yet—to our 
benchmark problem 


ek = 


y=a+y, y(0)=1. 


As usual, we shall calculate y(1) as a test. 

EXAMPLE 5.5.2 Apply the Runge-Kutta method to the differential equation 
y=at+y, y(0) = 1. (5.5.2.1) 

Take h = 1, so that the process has only a single step. 


Solution: We determine that 


m, = 1-(0+1)=1 

m2 = 1-(0+0.5+1+4+0.5) =2 
mgs = 1-(0+05+14+1)=25 
me = 1-(0+1414+2.5)=45. 


Thus 1 
Yi =1+5(1+4+5 44.5) = 3.417. 


Observe that this approximate solution is even better than that obtained with 
the improved Euler method for h = 0.2 (with five steps). And the amount of 
computation involved was absolutely minimal. 

Table 5.5.2.2 shows the result of applying Runge-Kutta to our differential 
equation with h = 0.2. Notice that our approximate value for y(1) is 3.436596, 
which agrees with the exact value to four decimal places. The relative error is 
less than 0.002 percent. 

If we cut the step size in half—to 0.1, then the accuracy is increased 
dramatically—see Table 5.5.2.3. Now the relative error is less than 0.0002 
percent. 
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Table 5.5.2.2 
Tabulated values for exact and numerical 
solutions to (5.5.2.1) with h = 0.2 
using the Runge-Kutta method 


Ln Yn Exact E, (%) 
0.0 1.00000 1.00000 0.00000 
0.2 1.24280 1.24281 0.00044 
0.4 1.58364 1.58365 0.00085 
0.6 2.04421 2.04424 0.00125 
0.8 2.65104 2.65108 0.00152 
1.0 3.436596 3.43656 0.00179 


Table 5.5.2.3 
Tabulated values for exact and numerical 
solutions to (2) with h = 0.1 
using the Runge-Kutta method 


Xn Yn Exact Ey, (%) 
0.0 1.00000 1.00000 0.0 

0.1 1.1103417 1.1103418 0.00002 
0.2 1.24281 1.24281 0.00003 
0.3 1.39972 1.39972 0.00004 
0.4 1.583652 1.58365 0.00006 
0.5 1.79744 1.79744 0.00007 
0.6 2.04424 2.04424 0.00008 
0.7 2.32750 2.32751 0.00009 
0.8 2.65108 2.65108 0.00010 
0.9 3.01920 3.01921 0.00011 
1.0 3.43656 3.43656 0.00012 


a 


Exercises 


For each of Exercises 1-5, use the Runge-Kutta method with h = 0.1,0.05, and 
h = 0.01, to estimate the solution at x = 1. Compare your results to the exact 
solution and the results obtained with both the Euler method (Exercises 1-5 of 
Section 5.2) and the improved Euler method (Exercises 1-5 of Section 5.4). 
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y =2e+2y, y(0)=1 
y =1/y, y(0)=1 
e¥, y(0)=0 


US RN 
—s 
II 


6. Use the Runge-Kutta method with h = 0.2 to find an approximate solu- 
tion of the following initial value problem. 
20 


ty” — 3ty’+3y=1, y(1)=0, y'(1) =0. 


Determine the exact solution and compare your results. Does the differ- 
ential equation possess a solution at t = 0? How might the Runge-Kutta 
method be employed to compute the solution at 0? 

7. Use your favorite scientific computing language—BASIC or Fortran or 
C or APL—to write a routine to implement the Runge-Kutta method. 
Apply the program to the initial value problem 


y =yta, y(0)=1. 


Now solve the same initial value problem using your symbol manipulation 
software (such as Maple or Mathematica). You will probably find that 
the symbol manipulation software is faster and has a higher degree of 
accuracy. Can you speculate why this is so? 


Now apply both methodologies to the initial value problem 


y =y—siny+czy, y(0)=1. 


Supply similar comparisons and commentary. 


5.6 A Constant Perturbation Method for Linear, 
Second-Order Equations 


The philosophy that we have employed in each of the numerical techniques 
of this chapter is a very simple one, and it parallels the philosophy that was 
used to develop numerical techniques of integration in calculus. Namely, we 
approximate the differential equation (more precisely, we approximate the 
coefficients of the differential equation) by polynomials. In the most rudimen- 
tary technique—Euler’s method—we approximate by constant functions. In 
the improved Euler method we approximate by linear functions. And in the 
last, most sophisticated technique (the Runge-Kutta method) we approximate 
by quadratic functions (or parabolas). 

This methodology just described, while straightforward and logical, has 
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its limitations. First of all, it is not adapted to the particular differential 
equation that is being studied. It is a universal technique. Second, it only 
works on very small intervals where the approximation (by constant, linear, 
or quadratic functions) is good. Third, it is not very flexible. 

In this section, we shall introduce a technique that is much more adapt- 
able. It will actually be different in its implementation for each differential 
equation to which we apply it. Instead of approximating by a universal object 
like a linear function, we shall approximate by a constant-coefficient differen- 
tial equation. The setup is as follows. 

Suppose that we are given an initial value problem 


y" +a(a)y’ + W(a)y=c(x), 2 € [a,b], y(a)= yo, y’(@) = y- 


Here a(x), b(x),c(a) are given functions. Thus our differential equation has 
variable coefficients. As is customary and familiar, we introduce a partition 


A=% <4 <+++Up_y < Up =D. 


Now, on each interval I; = [x;~1,2,;], we approximate each of the coefficient 
functions by a constant: 


a(x) < a;, b(x) o b;, C(x) > CG; . 


A convenient way (but by no means the only way) of choosing these constants 
is 


ee a(xj—-1) + a(a;) 


a; = a an 4 
j = dlticr) + b(25) 
ef) —_ 9 ’ 
~ _ ¢(aj-1) + e(2;) 
Cj = = ~@- == ‘ 


Thus, on each interval J;, we solve the approximating differential equation 
y +ayy +by= Gr LE [vj-1, 25] . (5.6.1) 


This is, of course, an equation that can be solved by hand. Let us assume for 
convenience that a; # Ab; for each j. With this assumption, we let 


—a; + \/@ — 4b, 


wo = 2 
—a; — (a3 — 4b; 
wv = —_—__——__... 
2 


Then it is easy to see that the general solution of the associated homogeneous 
equation 2 
yf + ayy’ + by =0 
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is 
(2) = Ager ea) ae Bye? “(> 9-1) : 
(Note that the use of (x — x;_1) instead of just x only contributes a multi- 
plicative constant, which is harmless.) Here, as usual, A; and B; are arbitrary 
constants. 
A particular solution of equation (5.6.1) is 


ae 
Yop FT SH UF 
P b J 


provided that b; # 0. In fact, in what follows, we shall assume that b; # 0, 
that a; € 0, and further that aj x 4b;. The interested reader may work out 
the details for the other cases, but the case that we treat is both typical and 
indicative. 

Thus we find that the general solution of (5.6.1) is given by taking a linear 
combination of the particular solution we have found and the general solution 
to the associated homogeneous equation: 


y(x) = Ajae” @-8-) a Bene ee i. ii; 
The first derivative of this equation is 
y (x) = Ajy_wte’* @-2-1) ale Bega? Se) ; 


The values of A;—; and B;_; on the jth interval [7;_1, 75] are then determined 
by suitable initial conditions 9(«;-1) = 971, 9'(aj-1) = 7). Thus we have 


j—1 


yl = Aj-1 + By-1 + uj 


and 
Lon 


yo = wt Aj-4 + w By-1 ‘ 


It follows that 


1 set = ~j— 
age ee es) 


Aj-1 = 


and 
1 


Bee tu; — yt). 
J wt —w ) 


= (wry? — why — 9 

Now we need to explain how the algorithm advances from step to step (as 
j increases, beginning at j = 1). In the first interval I; = [ao, 71], we construct 
Go, bo, Go and also w*,w, uo. The solution, in particular the values of Ag and 
Bo, are determined by the initial conditions y(a) = yo, y’(a) = yi. The value 
of this unique solution, and the value of its first derivative, at the point 2), 
are taken to be the initial conditions when we next perform our algorithm on 
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the interval Iz = [x,, x2]. This will determine a unique solution on the second 
interval Ip. 

In general, we take the unique solution produced by our algorithm on 
the interval J;_; and use the value of it and its first derivative to give initial 
conditions for the initial value problem on J;. 

The advantage of this new methodology is that the approximations come 
directly from the coefficients a(x), b(x), c(a) of the original equation. The user 
can control the size of the deviations a(x)—4a,, b(x) bs c(x)—¢; by adjusting 
the size of the intervals in the partition. One upshot is that the range of values 
for h under which we get an acceptable approximation to the desired solution 
can, in principle, be quite large. 

The reader may wish to practice with this new method by applying it to 
the initial value problem 


y” —Aay! + (42? + a? —2)y+ are” 0, y=1lyw=8 


for x € [0,5]. See how the behavior of the approximation changes for different 
values of a € [1,25] and ( € [0, 25]. 


Te 


Problems for Review and Discovery 


A. Drill Exercises 


1. For each of these exercises, use the Euler method with h = 0.1, 0.05, and 
0.01 to estimate the solution at x = 1. In each case, compare your results 
to the exact solution and discuss how well (or poorly) the Euler method 
has worked. 


(a) y =x—-2y, y(0)=2 
(b) y=1/y*, y(0)=1 
(c) y’=e%, y(0)=1 
(d) y=y+tcosx, y(0)=—-2 
(e) y =(2-y+1)’, y(0)=0 
(f) y =a y(0)=1 
2. In each of these exercises, use the exact solution, together with step sizes 
h = 0.2 and 0.1, to estimate the total discretization error that occurs 
with the Euler method at x = 1. 
(a) y =2rt+y, y(0)=0 
(b) y =z (0) =2 
(c) y’=e%, y(0)=0 
y 
y 


(d) y=ytocosy, y(0)=-2 
(e) y =(e@-y+1)’, y(0)=0 
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(f) y=ax-3y, y(0)=1 

3. In each of these exercises, use the improved Euler method with h = 
0.1, 0.05, and 0.01 to estimate the solution at x = 1. Compare your 
results to the exact solution and the results obtained with the original 
Euler method in Exercise 1 above. 


(a) yo =x-2y, y(0)=2 
(b) y’=1/y?, y(0)=1 
(c) y=e%, y(O)=1 
(d) y =y+t+cosz, y(0)=—2 
(e) y =(e@-y+1)’, (0) =0 
(f) Yy=s5, y(0)=1 

4. For each of these exercises, use the Runge-Kutta method with h = 
0.1, 0.05, and h = 0.01 to estimate the solution at x = 1. Compare your 
results to the exact solution and the results obtained with both the Euler 
method (Exercise 1) and the improved Euler method (Exercise 3). 
(a) y’=x-2y, y(0)=2 
(b) y'=1/y?, y(0)=1 
(c) y=e", y(0)=1 
(d) y’=y+t+cosz, y(0)=—2 
(e) y =(e@-yt+1)’, y(0)=0 
(f) y =a, y(0)=1 

B. Challenge Problems 


1. Consider the initial value problem 


Apply the Euler method at x = 2 with step size h and show that the 
resulting approximation is 
a\2/h 
Ar{1l--= “ 
(-3) 


2. Apply the improved Euler method to the initial value problem 


yays UO t: 


Use step sizes h = 1,0.1, 0.01, 0.001, 0.0001 to get better and better ap- 
proximations to Euler’s constant e. What number of decimal places of 
accuracy do you obtain? 

3. Use the Runge-Kutta method with step size h = 0.1 to approximate the 
solution to 

y =sin(4y) — 2a, y(0) = 0 

at the points 0,0.1,0.2,...,1.9,2.0. Use this numerical data to make a 
rough sketch of the solution y(x) on the interval [0, 2]. 
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4. Use the Runge-Kutta method with step size h = 0.05 to approximate 
the solution to 
y' = 4sin(y — 3x), y(0) =1 


at the points 0, 0.05, 0.1, 0.15, 0.2,...,0.95, 1. Use this numerical data to 
make a rough sketch of the solution y(x) on the interval [0, 1]. 


C. Problems for Discussion and Exploration 


1. Devise an initial value problem that will enable you to get a numerical 
approximation to the value of the number z. Use this device to compute 
am to four decimal places of accuracy. 


2. The logistic equation 


oe =ap— bp” p(0) = po 

is often used as a simple model of population growth. Take a = 1,b = 
2,po = 50, and step size 0.1. Use Euler’s method to approximate the 
value of the solution at « = 2. Now use the improved Euler method. 
What increase in accuracy do you obtain? Conclude by applying the 
Runge-Kutta method. What increase in accuracy do you see now? 


3. Replace the logistic equation in Exercise 2 with the more general equation 


dp 

— =ap-— bp" 0) = 

HE oP ~ bP, (0) = po 

for some parameter r > 1. Take a = 2,b = 1,p0 = 1.5, and explore the 
effect of varying the parameter r. Conduct this exploration using Euler’s 
method with step size h = 0.1. Now use the improved Euler method and 
see how things change. 


4. It is standard to model the velocity of a falling body with the initial value 


problem 
du 
m—=mg—kv, v(0)=v0, (*) 
dt 
where g is the acceleration due to gravity, —kv is air resistance, and m is 
the mass of the body. Explain why this is a correct physical model, just 
using Newton’s laws from elementary physics. Of course this equation 


may be solved explicitly. 


In some settings it is appropriate to replace the air resistance terms 
with —kv" for some r > 1. Then the initial value problem becomes 


m—=mg—kv', v(0)0=v0. (x) 


Explore the effect of changing the parameter r by taking m = 5, g = 9.81, 
k = 4, and vo = 0. Use the improved Euler method with step size h = 0.1 
on the interval [0, 10]. Now use the Runge-Kutta method and see whether 
you can learn more. 
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5. In the study of nonisothermal flow of a Newtonian fluid between parallel 
plates, one encounters the ordinary differential equation 


2 
<a tae’ =0, xz>o0. 


There is a sequence of changes of variable that will transform this equa- 


tion to j 
Ws Se has i 3 | i 5 2 
Tu u(S+l)o (utd) o%, 


See whether you can discover the changes of variable that will effect this 
transformation. 


Now use the Runge-Kutta method to approximate v(2) if v(2.1) = 0.1. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 
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Fourier Series: Basic Concepts 


The idea of Fourier series 


Calculating a Fourier series 


Convergence of Fourier series 


Odd and even functions 


e Fourier series on arbitrary intervals 


Orthogonality 


a 


6.1 Fourier Coefficients 


Trigonometric and Fourier series constitute one of the oldest parts of analysis. 
They arose, for instance, in classical studies of the heat and wave equations. 
Today they play a central role in the study of sound, heat conduction, electro- 
magnetic waves, mechanical vibrations, signal processing, and image analysis 
and compression. Whereas power series (see Chapter 3) can only be used to 
represent very special functions (most functions, even smooth ones, do not 
have convergent power series), Fourier series can be used to represent very 
broad classes of functions. 
For us, a trigonometric series or Fourier series is one of the form 


1 


f(x) = 500 + S- @ cos nx + by, sin ne) . (6.1.1) 


n=1 
We shall be concerned with three main questions: 


1. Given a function f, how do we calculate the coefficients ay, b,? 


2. Once the series for f has been calculated, can we determine that it 
converges, and that it converges to f? 


3. How can we use Fourier series to solve a differential equation? 
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We begin our study with some classical calculations that were first per- 
formed by Euler (1707-1783). It is convenient to assume that our function f 
is defined on the interval [—7,7] = {t € R: —n < a < mr}. We shall tem- 
porarily make the important assumption that the trigonometric series (6.1.1) 
for f converges uniformly. While this turns out to be true for a large class of 
functions (continuously differentiable functions, for example), for now this is 
merely a convenience so that our calculations are justified. 

We apply the integral to both sides of (6.1.1). The result is 


 fl@de 


wT 1 co 
- — [Geo Xo (ar cosne + basinne) ) ae 
m4 co Tw co Tv 
= / saode + >> f ancosnade + >> bn sinna dz. 
n=1"—F n=1" 7h 


TT 


| 


The change in order of summation and integration is justified by the uniform 
convergence of the series (see [KRA2, page 202, ff.]). 
Now each of cos na and sin nx integrates to 0. The result is that 


ao = — 7 f(a) daz. 


T 


In effect, then, ag is (twice) the average of f over the interval [—7, 7]. 
To calculate a; for 7 > 1, we multiply the formula (6.1.1) by cos jx and 
then integrate as before. The result is 


us wT 1 co 
f(x) cos jxdx = , {5 + Xe («0 cos nx + by sin ne) bos ja dx 


= n=1 


wT 1 co wT 
=f saocosjedr+ > | Gn COSNZ Cos jx dx 
Tv 2 n=1 ex | 


+> / bp, sinnx cos jx dx . (6.1.2) 
nal 


Now the first integral on the right vanishes, as we have already noted. Further 
recall that 


COS NX COS JL = (costu + j)x + cos(n — ie) 


NlR 


and 


NIlR 


sINNxX COS 7X = 


(sin(n + j)x + sin(n — a) 
It follows immediately that 


Tv 
i cos nz cos jx dx = 0 when n 4 j 


= 
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and - 
/ sin nz cos jz dx = 0 for all n,7. 


Thus our formula (6.1.2) reduces to 


TT 


Tv 
f(x) cos ja dx = a; cosjxcos jx dx. 
j 
= 


ie 


We may use our formula above for the product of cosines to integrate the 
right-hand side. The result is 


Tv 
f(x) cos jadz = a;-4 
—w 


or 


1 as 
a;=— f(x) cos jada. 
T Jan 
A similar calculation shows that 
1 7 
b= - f(a) sin jx dz. 
T Jin 
In summary, we now have formulas for calculating all the a;’s and 6,’s: 
1” : ; 
aj =~ f(x)cosjadx , j=0,1,... 
and Loft 
b= — f(x)singadx, j=1,2,.... 
T Jan 


EXAMPLE 6.1.3 Find the Fourier series of the function 
FB) Ha; —T7<a<T. 


Solution: 
Of course 


sa ie ge 
a= = | ade = — + 
T nm 2 


=< 


For 7 > 1, we calculate a; as follows: 
1 vig 
a; = ~ | xcos jx dx 
as = 
(parts) 1 ( sin jx 
= _ «L——_ 
T J 


) 
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Similarly, we calculate the b;: 


1 Tv 
b= =f xsin jx dx 


(parts) +(0—2 — cos 8 ja)" 
= ¢———— 


ic) ee ax) 

1 J 

_ 1 ee sin ja |" 

= = @ |. 
2. Bes 


1)s+4 
Now that all the coefficients have been calculated, we may summarize the 
result as 


in 2 : 
b= f(e) =2(sine - sin2¢  sin3x oy). 


5 3 
: 


It is sometimes convenient, in the study of Fourier series, to think of 
our functions as defined on the entire real line. We extend a function that 
is initially given on the interval [—7,7] to the entire line using the idea of 
periodicity. The sine function and cosine function are periodic in the sense 
that sin(a + 27) = sinw and cos(a + 277) = cosa. We say that sine and cosine 
are periodic with period 27. Thus it is natural, if we are given a function f on 
[—1,7), to define f(a + 2m) = f(x), f(v+ 2-2) = f(x), f(x — 27) = f(a), 
etc.! 

Figure 6.1 exhibits the periodic extension of the function f(x) = x on 
[—7, 7) to the real line. 

Figure 6.2 shows the first four summands of the Fourier series for f(x) = a. 
The finest dashes show the curve y = 2sinz, the next finest is — sin 2a, the 
next is (2/3) sin 3a, and the coarsest is —(1/2) sin 4a. 

Figure 6.3 shows the sum of the first four terms of the Fourier series and 
also of the first six terms, as compared to f(a) = x. Figure 6.4 shows the sum 
of the first eight terms of the Fourier series and also of the first ten terms, as 
compared to f(x) = a. 


EXAMPLE 6.1.4 Calculate the Fourier series of the function 


Ve 0 if —-7t<2<0 
g(x) = mn if O<a<m. 


Solution: 


1Notice that we take the original function f to be defined on [—7, 7) rather than [—7, 7] 
to avoid any ambiguity at the endpoints. 
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FIGURE 6.1 
Periodic extension of f(a) = x. 


FIGURE 6.2 
The first four summands for f(a#) = x. 
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~o 
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2. 


NKR, 


=z 


MKR. 


FIGURE 6.3 


The sum of four terms and of six terms of the Fourier series of f(a) = x 


P 
P 
OKR 
OKR 
0 
0 
NKR 
NKR 
N 
N 
MKR tic 
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FIGURE 6.4 


The sum of eight terms and of ten terms of the Fourier series of f(a”) = x 
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FIGURE 6.5 
The sum of four and of six terms of the Fourier series of g. 


Following our formulas, we calculate 


i, fe 1 f° oi 
a == | g(a) de == | ode+— f rdt=T. 
—t —T 0 


1 /* sin na |" 
adn = — 7 cosnx dx = 
0 


Tv nr 


0) 


i; 1 : 
n= = | msinnz dz = —(1—cosnm) = —(1—(-1)"). 
0 My . 


T 


Another way to write this last calculation is 


2 
boy =0, bait = 5 


In sum, the Fourier expansion for g is 


2 


sin3x2  sindx Po 
3 5 , 


g(x) = ae (sine + 


Figure 6.5 shows the fourth and sixth partial sums, compared against the 
function g(x). Figure 6.6 shows the eighth and tenth partial sums, compared 
against the function g(z). | 


EXAMPLE 6.1.5 Find the Fourier series of the function given by 
=> & =< a< 0 

h(a) = 
5 if O<a<7. 


Solution: 
This is the same function as in the last example, with 7/2 subtracted. Thus 
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FIGURE 6.6 
The sum of eight terms and of ten terms of the Fourier series of g. 


the Fourier series may be obtained by subtracting 7/2 from the Fourier series 
that we obtained in that example. The result is 


‘ nb 
he) =2 (sine + SS 4) . 


5 


The graph of this function, suitably periodized, is shown in Figure 6.7. Ml 


EXAMPLE 6.1.6 Calculate the Fourier series of the function 


if —-7<a<0 
if O<a<7. 


Solution: 

This function is simply the function h from Example 6.1.5 minus half the 
function f from Example 6.1.3. In other words, k(a) = h(a) — [1/2] f(x). Thus 
we may obtain the requested Fourier series by subtracting half the series from 
Example 6.1.3 from the series in Example 6.1.5. The result is 


f(a) = 2 (sine + sin 3x te sin 5x +) 


3 5) 
; sin2x sin3z 
- (sine - 5 + 3 -+-) 
es sin2x%  sin3a 
= snxz+ 5 + 3 
S sin nx 
= i. 


The graph of this series is the sawtooth wave shown in Figure 6.8. | 
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FIGURE 6.7 
Graph of the function f in Example 6.1.5. 


FIGURE 6.8 
The sawtooth wave. 


236 CHAPTER 6: FOURIER SERIES: BASIC CONCEPTS 


TT 


Exercises 


1. Find the Fourier series of the function 


e)= 
F@) 0 if teaK<n 
2 
2. Find the Fourier series for the function 
0 if -7<ax2<0 
f(x) = 1 if O<x<§F 
0 if a <@Sn 


3. Find the Fourier series of the function 


iay={ § if —-7<a2<0 

sing if O<a<7. 

4. Solve Exercise 3 with sin x replaced by cos z. 

5. Find the Fourier series for each of these functions. Pay special attention 
to the reasoning used to establish your conclusions; consider alternative 
lines of thought. 


(a) f(x@)=7, -t<aKdca 

(b) f(z) =sinz, -t<a<a7 

(c) f(a) =cosxz, -at<au<a 

(d) f(z) =a7+sinzr+cosx, —-rt<a<t7 


Solve Exercises 6 and 7 by using the methods of Examples 6.1.5 and 
6.1.6, without actually calculating the Fourier coefficients. 


6. Find the Fourier series for the function given by 
(a) 
—a if -7<2<0 
f= { 5 if O<a<a7 


(b) 
-1 if -rw<a<0 
lay={ 3 if O<a<a 


(c) 
_f -—$ if -rt<a<0 
fa) ={ z if O<a<n 


(d) 


(e) 


1 if -r7<a<0 
seay={ if O<a<a 
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7. Without using the theory of Fourier series at all, show graphically that 
the sawtooth wave of Figure 6.1 can be represented as the sum of a 
sawtooth wave of period 7 and a square wave of period 7. 


rr 


6.2. Some Remarks about Convergence 


The study of convergence of Fourier series is both deep and subtle. It would 
take us far afield to consider this matter in any detail. In the present section 
we shall very briefly describe a few of the basic results, but we shall not prove 
them. See [KRA3] for a more thoroughgoing discussion of these matters. 

Our basic pointwise convergence result for Fourier series, which finds its 
genesis in work of Dirichlet (1805-1859), is this: 


Definition 6.2.1 Let f be a function on [—7, 7]. We say that f is piecewise 
smooth if the graph of f consists of finitely many continuously differentiable 
curves, and furthermore that the one-sided derivatives exist at each of the 
endpoints {p1,...,px} of the definition of the curves, in the sense that 


km 2 Pit) = FCP) aaa im £2 +2) = Flos) 
h—ot h h—0- h 


exist. Further, we require that f’ extend continuously to [p;,pj;+1] for each 
j=1,...,k—1. See Figure 6.9. 


Theorem 6.2.2 Let f be a function on [—7,7]| which is 
piecewise smooth and overall continuous. Then the Fourier 


series of f converges at each point c of [—1,7] to f(c). 


Let f be a function on the interval [—7,7]. We say that f has a simple 
discontinuity (or a discontinuity of the first kind) at the point c € (—7,7) if 
the limits lim f(x) and lim,_,.+ f(x) exist and 


Jim f(x) # lim, f(x). 


ed Oe 


The reader should understand that a simple discontinuity is in contradistinc- 
tion to the other kind of discontinuity. That is to say, f has a discontinuity of 
the second kind at c if either lim,_,.- f(x) or lim,_,,+ f(a) does not exist. 


EXAMPLE 6.2.3 The function 
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FIGURE 6.9 
A piecewise smooth function. 


FIGURE 6.10 
A simple discontinuity. 
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1 k 


: 


FIGURE 6.11 
A discontinuity of the second kind. 


has a simple discontinuity at x = 1. It is continuous at all other points of the 
interval [—7, 7]. See Figure 6.10. 


The function F 4 
sn> if «0 
g(a) = { 0 if «=0 
has a discontinuity of the second kind at the origin. See Figure 6.11. a 


Our next result about convergence is a bit more technical to state, but it 
is important in practice, and has historically been very influential. It is due 
to L. Fejér. 


Definition 6.2.4 Let f be a function and let 


1 co 
520 + S- («, cosnax + b, sin na) 


n=1 
be its Fourier series. The Nth partial sum of this series is 


N 


1 
Swn(f)(x) = Phas + > («, cosnz + by, sin ne) ‘ 


n=1 
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The Cesaro mean of the series is 
1 N 
on(f)(x) = Wael » S;(f)(z). 


In other words, the Cesaro means are simply the averages of the partial sums. 


Theorem 6.2.5 Let f be a continuous function on the in- 


terval |—1,7]. Then the Cesaro means on (f) of the Fourier 
series for f converge uniformly to f. 


It is worth noting explicitly that if the Fourier series of a function f 
converges at a point x, then the Cesaro means of the series also converge at 
xp—and to the very same limit. 

A useful companion result is this: 


Theorem 6.2.6 (Fejér) Let f be a piecewise continuous 
function on [—7,7]—meaning that the graph of f consists 
of finitely many continuous curves. Let p be the endpoint of 
one of those curves, and assume that lim,_,,— f(x) = f(p7) 
and lim,_,»+ f(«) = f(pt) both exist (and are possibly un- 
equal). Then the Cesaro means of the Fourier series of f at 
p converge to [f(p~) + f(p*)]/2. 


In fact, with a few more hypotheses, we may make the result even sharper. 
Recall that a function f is monotone increasing if x1 < x2 implies f(x) < 
f (x2). The function is monotone decreasing if x1 < x2 implies f(a1) > f(x). 
If the function is either monotone increasing or monotone decreasing then we 
just call it monotone. Now we have this result of Dirichlet: 


Theorem 6.2.7 (Dirichlet) Let f be a function on 
[—7,7] which is piecewise continuous. Assume that each 
piece of f is monotone. Then the Fourier series of f con- 


verges at each point of continuity c of f in [—1,7] to f(c). 
At other points x it converges to [f(a~) + f(a*)|/2. 


The hypotheses in this theorem are commonly referred to as the Dirichlet 
conditions. 


6.2. 


By linearity, we may extend this last result to functions that are piece- 
wise the difference of two monotone functions. Such functions are said to be 
of bounded variation, and exceed the scope of the present book. See [KRA2] 
for a detailed discussion. The book [TIT] discusses convergence of the Fourier 
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series of such functions. 


a 


Exercises 


1. 


In Exercises 1, 2, 3, 4 of the last section, sketch the graph of the partial 
sum S14 of each Fourier series on the interval —7 < x < z. Also, in each 
case, sketch the graph of the full sum of the Fourier series. Of course use 
the theorems in this section to aid you in your work. 


Find the Fourier series for the periodic function defined by 


—n if -r7<a2<0 
lay={ | if O<a<a 


Sketch the graph of the sum of this series on the interval —5a < x < 5a 
and find what numerical sums are implied by the convergence behavior 
at the points of discontinuity x = 0 and x = 7, etc. 


(a) Show that the Fourier series for the periodic function 


0 if -r<a2<0 
f(a)={ 9. if O<a<n 


is 


yi aes 


fle) = E42 


an ain jar tye sin(2j — 1)a 


(27 — 1) 


im 


a 1 


(b) Sketch the graph of the sum of this series on the interval —5a < a < 


oT. 
(c) Use the series in part (a) with = 0 and x = z to obtain the two 
sums 
ee re ee: rae n 
22." 32d? = 
and 
ee ore: mn 
l+ stata — 
22 3? 4 6 


(d) Derive the second sum in (c) from the first. [Hint: Add 
255, (1/[23])? to both sides.] 
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4. What can you say about the convergence of the Fourier series of the 


function 
-1 if «<0 
f(z) =< 0 if «=0 
1 if «>0 
at the origin? 
5. (a) Find the Fourier series for the periodic function defined by f(x) = e”, 
—n <a <7. (Hint: Recall that cosh x = (e” + e~*)/2.] 
(b) Sketch the graph of the sum of this series on the interval —5a < « < 
or. 


(c) Use the series in (a) to establish the sums 


1 at | a 1) 
pk 2 tanh 7 


and 


5, Ee =F / ry #1) 
sl 2 sinh 7 . 


6. It is usually most convenient to study classes of functions that form linear 
spaces, that is, that are closed under the operations of addition and scalar 
multiplication. Unfortunately, this linearity condition does not hold for 
the class of functions defined on the interval [—7,7] by the Dirichlet 
conditions. Verify this statement by examining the functions 


2: 1 : 
_ fj wsine+2x¢ if «40 
One if x=0 


and 
g(x) = -22. 

7. If f is defined on the interval [—7, 7] and satisfies the Dirichlet conditions 
there, then prove that f(a) = lim ise f(t) and f(at) = lim te f(t) 
exist at every interior point, and also that f(x*) exists at the left endpoint 
and f(x) exists at the right endpoint. [Hint: Each interior point of 
discontinuity is isolated from other such points, in the sense that the 
function is continuous at all nearby points. Also, on each side of such a 
point and near enough to it, the function does not oscillate; it is therefore 
increasing or decreasing.| 


TT 


6.3. Even and Odd Functions: Cosine and Sine Series 


A function f is said to be even if f(—x) = f(a). A function g is said to be 
odd if g(—x) = —g(a). 


6.8. EVEN AND ODD FUNCTIONS 
4 A 


FIGURE 6.12 
An even and an odd function. 
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EXAMPLE 6.3.1 The function f(x) = cosz is even because cos(—) = cos. 


The function g(x) = sin is odd because sin(—x) = — sina. 


The graph of an even function is symmetric about the y-axis. The graph 
of an odd function is skew-symmetric about the y-axis. Refer to Figure 6.12. 


If f is even on the interval [—a, a] then 
; f(a) dx = 2f se)ar 
and if f is odd on the interval [—a, a] then 
: f(x) dx =0. 


Finally, we have the following parity relations 
(even) - (even) = (even) (even) - (odd) = (odd) 
(odd) - (odd) = (even) . 


(6.3.1) 


(6.3.2) 


Now suppose that f is an even function on the interval [—1,7z]. Then 


f(x) + sin nz is odd, and therefore 


1 Tv 
bn = — f(x) sinnz dz = 0. 
T Je 


For the cosine coefficients, we have 


1” 2 [" 
an = — fle) cosne dx = = [ f(x) cos na dz. 
—T 0 


Tv 
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FIGURE 6.13 
Periodic extension of f(x) = |2]. 


Thus the Fourier series for an even function contains only cosine terms. 
By the same token, suppose now that f is an odd function on the interval 
[—a,7]. Then f(x) -cosna is an odd function, and therefore 


dn = f(x) cosna dx =0. 


For the sine coefficients, we have 


1 /[* De ft 
bn = — f(o)sinncde = = | f(x) sin na da. 
—T 0) 


Thus the Fourier series for an odd function contains only sine terms. 


EXAMPLE 6.3.2 Examine the Fourier series of the function f(x) = x from the 
point of view of even/odd. 


Solution: 
The function is odd, so the Fourier series must be a sine series. We calculated 
in Example 6.1.1 that the Fourier series is in fact 


sin2e  sin3zr ) 
eee Creer) We 


x= f(x)= 2(sin Ser + 3 (6.3.2.1) 


The expansion is valid on (—7,7), but not at the endpoints (since the series 
of course sums to 0 at —7 and 7). a 


EXAMPLE 6.3.3 Examine the Fourier series of the function f(x) = || from 
the point of view of even/odd. 


Solution: The function is even, so the Fourier series must be a cosine series. 


In fact we see that 
1 [* 2p 
a == | iz|ae = = | cdx=T. 
T Jom T Jo 
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Also, for n > 1, 
2 [* 2 [* 
An = =| |x| cosna dx = -/ xcosnz dx. 
T JO T Jo 
An integration by parts gives that 


an = —(cosnn -l= 2-1)" — 1). 


TN 
As a result, 
4 
ag; =O0 and agj_1 = FQ7 12 » J=1,2, 
In conclusion, 
Wie au 7 *(cos+ cose cose -) (6.3.3.1) 


The periodic extension of the original function f(x) = |a| on [—7, 7] is 
depicted in Figure 6.13. By Theorem 6.2.7 (see also Theorem 6.2.2), the series 
converges to f at every point of [—7, 7]. a 


It is worth noting that « = |x| on [0,2]. Thus the expansions (6.3.2.1) 
and (6.3.3.1) represent the same function on that interval. Of course (6.3.2.1) 
is the Fourier sine series for x on [0,7] while (6.3.3.1) is the Fourier cosine 
series for x on [0,7]. More generally, if g is any integrable function on [0,7], 
we may take its odd extension g to [—7,7a] and calculate the Fourier series. 
The result will be the Fourier sine series expansion for g on [0,7]. Instead we 
could take the even extension g to [—7,7] and calculate the Fourier series. 
The result will be the Fourier cosine series expansion for g on (0, 7]. 


EXAMPLE 6.3.4 Find the Fourier sine series and the Fourier cosine series ex- 
pansions for the function f(#) = cos on the interval [0, z]. 


Solution: x 
Of course the Fourier series expansion of the odd extension f contains only 
sine terms. Its coefficients will be 


wT n2—1 


a 7" an 
nae f cos xsin nz dr = me (HED") if n>. 
As a result, 


89 


b2j-1 =0 and bo; = m(4j2 — 1) 


, 9=1,2,.... 
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The sine series for f is therefore 


sin 27 
cosx = f ae = ,O<a<mT. 


To obtain the cosine series for f, we consider the even extension f. Of 
course all the 6, will vanish. Also 


SO eee ee 
ae aa t= 0 if nA#él. 


We therefore see, not surprisingly, that the Fourier cosine series for cosine on 
(0, 7] is the single summand cos x. a 


Exercises 
1. Determine whether each of the following functions is even, odd, or neither. 
x sinz, x’ sin2z, e”, (sin x)®, sin x’, cos(x + x), 


l+a 
l-—g 


eta +2°, In 


2. Show that any function f defined on a symmetrically placed interval can 
be written as the sum of an even function and an odd function. [Hint: 
f(x) = gIf(e) + f(—2)] + alf(e) — f-2)1] 

3. Prove properties (6.3.1) and (6.3.2) analytically, by dividing the integral 
and making a suitable change of variables. 


4. Show that the sine series of the constant function f(x) = 7/4 is 


sin3x2  sindx 
5 


for 0 < x < a. What sum is obtained by setting « = 7/2? What is the 
cosine series of this function? 


as ‘ 
—=sing+ 


5. Find the Fourier series for the function of period 27 defined by f(x) = 
cos x/2, —x < a < m. Sketch the graph of the sum of this series on the 
interval —5a <a < 5a. 

6. Find the sine and the cosine series for f(x) = sing. 

7. Find the Fourier series for the 27-periodic function defined on its funda- 
mental period [—7, 7] by 


if —7<2z<0 
if 0 <a<qm 


ye 


fla) = 


VIANA 


—ar+ 


(a) by computing the Fourier coefficients directly; 


6.3. 


10. 
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(b) using the formula 


nm A 
jc] = = ——(cosx+ 
T 


3 ta 


cos3xz  cosdx 


from the text. 
Sketch the graph of the sum of this series (a triangular wave) on the 
interval —5a <a < 5a. 
For the function f(x) = 7 — 2, find 
(a) its Fourier series on the interval —7 < x <7; 
(b) its cosine series on the interval 0 < x <7; 
(c) its sine series on the interval 0 < x < 7. 
Sketch the graph of the sum of each of these series on the interval 
[—57, 5a]. 
if O<a<7/2 
ei u<n 
fay={ 5 if sl eed 


Show that the cosine series for this function is 
7 
f@)=F-=-y ee 
4 om (27 —1)12 


Sketch the graph of the sum of this series on the interval [—5z, 57]. 
(a) Show that the cosine series for x? is 


2 co . 
2. 0 j COS JX 


(b) Find the sine series for x? and use this expansion together with the 
formula (6.3.2.1) to obtain the sum 
3 
re ee 
32 


ae Rs 


(c) Denote by s the sum of the reciprocals of the cubes of the odd positive 
integers: 


S=oatatatet::, 
and show that then 


pe ee ee ee 
p13 * 93 © 33" 48 | (aa 


j=l 


The exact numerical value of this last sum has been a matter of great 
interest since Euler first raised the question in 1736. It is closely related to 
the Riemann hypothesis. Roger Apéry proved, by an extremely ingenious 
argument in 1978, that s is irrational.” 


?The Riemann hypothesis is perhaps the most celebrated open problem in modern mathe- 
matics. Originally formulated as a question about the zero set of a complex analytic function, 
this question has profound implications for number theory and other branches of mathe- 
matics. The recent books [DER], [SAB] discuss the history and substance of the problem. 
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11. (a) Show that the cosine series for x° is 


_ y ar + cos cos(2j — 1)z —1)a 
aaa a a (Q7—-1!’ O<a<m. 
(b) Use the series in (a) to obtain 


4A 


= wv <s | 
(i) 2 G1 Ean = 54 and (ii) Lae 


12. (a) Show that the cosine series for «* is 


(b) Use the series in (a) to find a new derivation of the second sum in 
Exercise 11(b). 


13. The functions sin? x and cos? x are both even. Show, without using any 
calculations, that the identities 


1 1 1 
sin t= 5(1 cos 2a) = 5 5 cos 2x 


and 


cose = 5 (1 cos 22) = 5 | cose 


are actually the Fourier series expansions of these functions. 

14. Find the sine series of the functions in Exercise 13, and verify that these 
expansions satisfy the identity sin? x + cos” x = 1. 

15. Prove the trigonometric identities 


‘ 1 : 1 
sin? 7 = sin - q sin 3x and cos? xz = s+ 7 cos 82 


and show briefly, without calculation, that these are the Fourier series 


expansions of the functions sin? x and cos? x. 


6.4 Fourier Series on Arbitrary Intervals 
We have developed Fourier analysis on the interval [—7, 7] (resp. the interval 


(0, z]) just because it is notationally convenient. In particular, 


/ cosjxcoskadx=0 forj7#k, 


TT 


/ sinjxsnkxdx=0 for j#k, 


=I 
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i sinjxcoskxdz =0 for all j,k, 


and so forth. This fact is special to the interval of length 27. But many physical 
problems take place on an interval of some other length. We must therefore 
be able to adapt our analysis to intervals of any length. This amounts to a 
straightforward change of scale on the horizontal axis. We treat the matter in 
the present section. 

Now we concentrate our attention on an interval of the form [—L, L]. As 
zx runs from —L to L, we shall have a corresponding variable ¢ that runs from 
—7 to 7. We mediate between these two variables using the formulas 


Lt 
f= Ee and (oe 


Thus the function f(x) on [—L, L] is transformed to a new function f(t) = 
f(£t/7) on [—2, 7]. 

If f satisfies the conditions for convergence of the Fourier series, then so 
will f, and vice versa. Thus we may consider the Fourier expansion 


= 1 ess 
f®= 520 a a (« cos nt + bp, sin nt) . 
Here, of course, 


1 [" ~ 1 f[" ~ 
an = — f(t)cosntdt and b, = — f(t) sin nt dt. 
T 


Tv th TT 


Now let us write out these last two formulas and perform the change of 
variables 7 = Lt/m. We find that 


1 TT 
in = — <a Pela) cos nt dt 


= vee cos. de 


nTe 
= Lf 2) 0s ae, 
i ee _ nnn 
=z f ta)sin Fae, 


EXAMPLE 6.4.1 Calculate the Fourier series on the interval [—2, 2] of the func- 
tion 


Likewise, 


0 if -2<2<0 
fe) ={ 4 if” Oe 2, 
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Solution: 
Of course L = 2 so we calculate that 


ak NTx { 1 if n=0 
Qn = - cos — dz = ; 
2 Jo 2 21. 


Also 


This may be rewritten as 
ba; =0 and boj—-1 = 


In conclusion, 


f(c) =f()= 50 + S- (« cos nt + b, sin nt) 


n=1 


1 = —2 ; . TX 
=57 oa ae CP) | 


j=1 


EXAMPLE 6.4.2 Calculate the Fourier series of the function f(a) = cos on 
the interval [—7/2, 7/2]. 


Solution: 
We calculate that 


Also, for n > 1, 


2 n/2 
Qn = =| cos x cos(2nx) dx 
T J—n/2 
Rope 4 
a =| = (costan + 1)a + cos(2n — 1)r) dx 
Tv 1/2 2 
1 (snr 1)z  sin(2n— ve) i 
— = { ———————— + ume— 
Tv 2n+1 2n—-1 —1/2 
2 al 1 4 
= at ees if is dd 
2, a\9n+1 In—1 m(4n? —1) ia 
SSeS i coed Pee 
zs 7 TE i an = i — (4n2 7 D im on 1s even. 
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A similar calculation shows that 


1 n/2 
bn = - | cos x sin 2nx dx 
WT J—n/2 
a ae 
— / = (sin(2n + 1)x + sin(2n — 1)e) dx 
T —n/2 2 
_ Lf =cos(2n+1)z | —cos(2n —1)a ae 
ae: 2n+1 2n-1 sag 
= 0. 


This last comes as no surprise since the cosine function is even. 
As a result, the Fourier series expansion for cosx on the interval 
[(—1/2, 2/2] is 


cosa = f(z) 


= —4 2mn1nrx 
cea 2S 7am) — 1). ae 
oS 4 2(2k — 1)ra 
is », m4Q2k—-12-l ~~ 7/2 = 


Exercises 


1. Calculate the Fourier series for the given function on the given interval. 


(a) f(@)=2, [-1,]] 


(b) g(x) =sinz, [-2,2] 

(c) A(x) =e*, [-3,3] 

(d) f(x)=2, [-1,]] 

(e) g(x) =cos2a, [—1/3, 7/3] 


(f) A(x) = sin(2x — 7/3), [—1,]] 
2. For the functions 
f(@) =-3, -2<a<0 
and 
gz) =3, O<2<2, 


write down the Fourier expansion directly from Example 6.4.1 in the 
text—without any calculation. 
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3. Find the Fourier series for these functions. 


(a) 
es 1l+z2 if -l<2<0 
AAC ae te pee ar et 
(b) 
f(x) = |e, —2<a<2. 
4. Show that 
L Ll . 2jre 
ae? a TZ O0<a<L. 


5. Find the cosine series for the function defined on the interval 0 < x < 1 by 
f(x) = x?—a+1/6. This is a special instance of the Bernoulli polynomials. 


6. Find the cosine series for the function defined by 


2 RO SBS 
fe={ 5 if bees 2. 


7. Expand f(#) = cosa in a Fourier series on the interval -1 <a <1. 


8. Find the cosine series for the function defined by 


Coe 


i 


6.5 Orthogonal Functions 


In the classical Euclidean geometry of 3-space, just as we learn in multivariable 
calculus class, one of the key ideas is that of orthogonality. Let us briefly review 
it now. 

If v = (v1, v2, v3) and w = (wi, we, ws) are vectors in R? then we define 
their dot product, or inner product, or scalar product to be 


V- WH UUW] + VQW2 + V3W3. 
What is the interest of the inner product? There are three answers: 


e Two vectors are perpendicular or orthogonal, written v L w, if and only if 
v-w=0. 


e The length of a vector is given by 


lvl = Vv. 
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e The angle 6 between two vectors v and w is given by 


see ce 


In fact all of the geometry of 3-space is built on these three facts. 

One of the great ideas of twentieth-century mathematics is that many 
other spaces—sometimes abstract spaces, and sometimes infinite-dimensional 
spaces—can be equipped with an inner product that endows that space with 
a useful geometry. That is the idea that we shall explore in this section. 

Let X be a vector space. This means that X is equipped with (i) a notion 
of addition and (ii) a notion of scalar multiplication. These two operations are 
hypothesized to satisfy the expected properties: addition is commutative and 
associative, scalar multiplication is commutative, associative, and distributive, 
and so forth. We say that X is equipped with an inner product (which we now 
denote by (, )) if there is a binary operation 


(e,e):XxX—-R 
satisfying the following properties for u,v,w € X and cE R: 
(v,v) 2 
(b) (v,w) = a v) 
(c) ( 
(d) ( 


v,v) =0 if and only if v = 0; 


av + Gw,u) = a(v,u) + B(w,u) for any vectors u,v,w € V and 
scalars a, (3. 


We shall give some interesting examples of inner products below. Before we 
do, let us note that an inner product as just defined gives rise to a notion of 
length, or a norm. Namely, we define 


Ilvll = V(v,v)- 


By Properties (a) and (c), we see that ||v|| > 0 and ||v|| = 0 if and only if 
v=0. 

In fact the two key properties of the inner product and the norm are 
enunciated in the following proposition: 
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Proposition 6.5.1 Let X bea vector space and (, ) an inner product 
on that space. Let || || be the induced norm. Then 


(1) The Cauchy—Schwarz—Buniakovski Inequality: Ifu,v © X 
then 
ju-v| < |jull - [lvl 


(2) The Triangle Inequality: Ifu,v € X then 


Ju + vl < [lull + [lvl - 


In fact, just as an exercise, we shall derive the Triangle Inequality from 
the Cauchy—Schwarz—Buniakovski Inequality. We have 
lut+v|? = ((utv),(ut+v)) 
= (u,u) + (u,v) + (v,u) + (v,v) 
|lull? + IIv||? + 2(u, v) 
lull? + [Ivil? + 2llul] - [Iv 


F |v)? 


x 


I 
= 


Now taking the square root of both sides completes the argument. We shall 
explore the proof of the Cauchy—Schwarz—Buniakovski Inequality in Exercise 
5. 


EXAMPLE 6.5.2 Let X = C[0,1], the continuous functions on the interval 
(0, 1]. This is certainly a vector space with the usual notions of addition of 
functions and scalar multiplication of functions. We define an inner product 
by 


fg) = 7 f(a)g(«) de 


for any f,g€ X. 
Then it is straightforward to verify that this definition of inner product 
satisfies all our axioms. Thus we may define two functions to be orthogonal if 


(f,g) =0. 


We say that the angle 6 between two functions is given by 


(f,9) 
WF lillgl 


The length or norm of an element f € X is given by 


Itl= VON = ([ sar) a . 


cos 6 = 
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EXAMPLE 6.5.3 Let X be the space of all real sequences {a;}9°, with the 
property that )°5" , |aj|? < oo. This is a vector space with the obvious notions 
of addition and scalar multiplication: 


{aj} + {bj} = {aj + bj} 


and 
c{aj} = {ca;}. 


Define an inner product by 
({aj}, {bj}) = S— ajb;. 
j=l 


Then this inner product satisfies all our axioms. | 


For the purposes of studying Fourier series, the most important inner 
product space is that which we call L?[—7,7]. This is the space of real func- 
tions f on the interval [—7, 7] with the property that 


f(x)? dx <oo. 
The inner product on this space is 
(f.9)= | f(a)g(@) da. 


One must note here that, by a variant of the Cauchy—Schwarz—Buniakovski 
inequality, it holds that if f,g € L? then the integral [ f - gdz exists and is 
finite. So our inner product makes sense. 


a 


Exercises 


1. Verify that each pair of functions f, g is orthogonal on the given interval 
[a, b] using the inner product 


(f.9) = | f(@)g(@) de. 


a 


(a) f(x) =sin2z, g(x) =cos3z, [-7,7] 
(b) f(x) =sin2a, g(x) =sin4z, [0,7] 
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(c) f(z) =2?, g(x)=2°, [-1,]] 
(d) f(z) =a, g(x) =cos2x, [—2,2] 
2. Prove the so-called parallelogram law in the space L?: 
2\IfI? + 2ilgll? = lf + oll? + If - oll? 


[Hint: Expand the right-hand side.] 
3. Prove the Pythagorean theorem and its converse in L?: The function f is 
orthogonal to the function g if and only if 


If — al? = IFIP +1l9IP- 


4. In the space L”, show that if f is continuous and ||f|| = 0 then f(x) =0 
for all x. 

5. Prove the Cauchy-Schwarz—Buniakovski Inequality in L*: If f,g € L? 
then 


Kf, 91S IFll- Ilgll- 
Do this by considering the auxiliary function 
(A) = Ilf + Agll? 
and calculating the value of 4 for which it is a minimum. 


6. Bessel’s inequality states that, if f is any square-integrable function on 
[—1, 7] (i-e., f € L*), then its Fourier coefficients a; and b; satisfy 


co T 


s+ Gi +8) <= [ [f@Pae. 


j=l —" 
This inequality is fundamental in the theory of Fourier series. 
(a) For any n > 1, define 


—T 


1 . is 
Sn(x) = 520 + 245 cos jz + b; sin jx) 
= 
and show that 
: ” f(e)en (x) dx = of 24a 7b) 
T = he a 


(b) By considering all possible products in the multiplication of s,, (a) 
by itself, show that 


1 1 . 
> fF eacoe t= Soi + te + 2) 


(c) By writing 


ah. " |f(a) = 8n(2)/? de 
7 an \f(e)P ae — 2 . f(a) 8n(w) da + af |sn(x)|* de 


n 


1 [” 1 
© [| fle) av — S05 — (a +09), 


j=1 
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conclude that 


s+ a+) <= f [s@Pae. 
j=l =e 
(d) Now complete the proof of Bessel’s inequality. 

7. Use your symbol manipulation software, such as Maple or Mathematica, to 
implement the Gram—Schmidt procedure to orthonormalize a given finite 
family of functions on the unit interval. Apply your software routine to 
the family 1,2,a7,--- ,a'®. 


Historical Note 
Riemann 


Bernhard Riemann (1826-1866) was the son of a poor country minister in 
northern Germany. He studied the works of Euler and Legendre while still in 
high school; indeed, it is said that he mastered Legendre’s treatise on number 
theory in less than a week. Riemann was shy and modest, with little awareness 
of his own extraordinary powers; thus, at age nineteen, he went to the Uni- 
versity of Gottingen with the aim of pleasing his father by studying theology. 
Riemann soon tired of this curriculum, and with his father’s acquiescence he 
turned to mathematics. 

Of course the great Carl Friedrich Gauss was the senior mathematician in 
Gottingen at the time. Unfortunately, Gauss’s austere manner offered little for 
an apprentice mathematician like Riemann, so he soon moved to Berlin. There 
he fell in with Dirichlet and Jacobi, and he learned a great deal from both. 
Two years later he returned to Gottingen and earned his doctorate. During 
the next eight years Riemann suffered debilitating poverty and also produced 
his greatest scientific work. Unfortunately his health was broken. Even after 
Gauss’s death, when Dirichlet took the helm of the Gottingen math institute 
and did everything in his power to help and advance Riemann, the young 
man’s spirits and health were well in decline. At the age of 39 he died of 
tuberculosis in Italy, where he had traveled several times to escape the cold 
and wet of northern Germany. 

Riemann made profound contributions to the theory of complex variables. 
The Cauchy—Riemann equations, the Riemann mapping theorem, Riemann 
surfaces, the Riemann—Roch theorem, and the Riemann hypothesis all bear 
his name. Incidentally, these areas are all studied intensely today. 

Riemann’s theory of the integral, and his accompanying ideas on Fourier 
series, have made an indelible impression on calculus and real analysis. 

At one point in his career, Riemann was required to present a probationary 
lecture before the great Gauss. In this offering, Riemann developed a theory of 
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geometry that unified and far generalized all existing geometric theories. This 
is of course the theory of Riemannian manifolds, perhaps the most important 
idea in modern geometry. Certainly Riemannian curvature plays a major role 
in mathematical physics, in partial differential equations, and in many other 
parts of the subject. 

Riemann’s single paper on number theory (published in 1859), just ten 
pages, is about the prime number theorem. In it, he develops the so-called 
Riemann zeta function and formulates a number of statements about that 
deep and important artifact. All of these statements, save one, have by now 
been proved. The one exception is the celebrated Riemann hypothesis, now 
thought to be perhaps the most central, the most profound, and the most 
difficult problem in all of mathematics. The question concerns the location of 
the zeros of the zeta function, and it harbors profound implications for the 
distribution of primes and for number theory as a whole. 

In a fragmentary note found among his posthumous papers, Riemann 
wrote that he had a proof of the Riemann hypothesis, and that it followed 
from a formula for the Riemann zeta function which he had not simplified 
enough to publish. To this day, nobody has determined what that formula 
might be, and so Riemann has left a mathematical legacy that has baffled the 
greatest minds of our time. 


TS 


6.6 Introduction to the Fourier Transform 


Many problems of mathematics and mathematical physics are set on all of 
Euclidean space—not on an interval. Thus it is appropriate to have analytical 
tools designed for that setting. The Fourier transform is one of the most 
important of these devices. In this section we explore the basic ideas behind 
the Fourier transform. We shall present the concepts in Euclidean space of 
any dimension. Throughout, we shall use the standard notation f € L or 
f € L'(R") to mean that f is integrable. We define a norm on L! by 


1= t)| dt. 
Ilf\lz i lf ()| 
If t,€ € R” then we let 


t-€ = +---+twEn. 


We define the Fourier transform of a function f € L1(IR”) by 


fO= | f@e** a. 


R” 


Here dt denotes N-dimensional volume: dt = dt,dt2---dtn. We sometimes 
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write the Fourier transform as F(f) or simply Ff. It is also occasionally 
useful, as we shall see below, to write the Fourier transform as 


(f) . 


Many references will insert a factor of 27 in the exponential or in the 
measure. Others will insert a minus sign in the exponent. There is no agree- 
ment on this matter. We have opted for this particular definition because of 
its simplicity. 


Proposition If f € L'(IR‘), then 


n~ 


uP If(Q) < Wiley): 


Proof: Observe that, for any € € RY, 


Nn 


als f emes|ae= fe lae= Illes a 


In our development of the ideas concerning the Fourier transform, it is 
frequently useful to restrict attention to certain “testing functions.” We define 
them now. Let us say that f ¢ C* if f is k-times continuously differentiable 
and f is identically zero outside of some ball. Figure 6.14 exhibits such a func- 
tion. 
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FIGURE 6.14 
A CX function. 


Proposition If f € L'(R%), f is differentiable, and 
Of /Ox; € L'(RY), then 


(52) © =i 70. 


Proof: Integrate by parts: if f € C1, then 


af OF aug 
ae = a cit é Gp 
eS « | 5 : 
= [fC ses at;) dt, ...dtj-1dtj41...dtn 
Ot; 
= -[- [10 (ses) dt; dt, ...dtj_ydtj41 ...dty 
J 


Z ~it; ff pper® at 


= -~if;f(é). 
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(Of course the “boundary terms” in the integration by parts vanish since 
f € C!.) The general case follows from a limiting argument. oO 


Proposition If f € L'(R%) and iz;f € L'(R%), then 


Proof: Differentiate under the integral sign. O 


Proposition (The Riemann—Lebesgue Lemma) If 
f € L(RY), then 


lim |f(€)| =0. 


E06 


Proof: First assume that g € C?(R%). We know that 
IIgllz~ < IIgllza < C 


and, for each J, 


This proves the result for g € C?. 

Now let f € Lt be arbitrary. It is easy to see that there is a C? function 
w such that [| f — | dx < €/2. 

Choose M so large that when |é| > M then |:(€)| < €/2. Then, for 
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|| > M, we have 


IFO = IF-W O+ dO) 
< \(f-¥) Ol+ YO) 
< |lf-¥llat+s 
€ € 
ai 3 + 3 = €. 
This proves the result. oO 


REMARK 6.6.1 The Riemann—Lebesgue lemma is intuitively clear when 
viewed in the following way. Fix an L! function f. An LZ? function is well- 
approximated by a continuous function, so we may as well suppose that f 
is continuous. But a continuous function is well-approximated by a smooth 
function, so we may as well suppose that f is smooth. On a small interval 
I—say of length 1/M for M large—a smooth function is nearly constant. So, 
if we let |€| >> 27M7?, then the character e’S* will oscillate at least M times 
on J, and will therefore integrate against a constant to a value that is very 
nearly zero. As M becomes larger, this statement becomes more and more 
accurate. That is the Riemann—Lebesgue lemma. 


The three Euclidean groups that act naturally on R% are 


e rotations 
e dilations 
e translations 


Certainly a large part of the utility of the Fourier transform is that it has 
natural invariance properties under the actions of these three groups. We 
shall now explicitly describe those properties. 

We begin with the orthogonal group O(N); an N x N matrix is orthogonal 
if it has real entries and its rows form an orthonormal system of vectors. A 
rotation is an orthogonal matrix with determinant 1 (also called a special 
orthogonal matrix). 


Proposition Let p be a rotation of RN. We define 
pf (x) = f(p(x)). Then we have the formula 


pf = pf. 


Proof: Remembering that p is orthogonal and has determinant 1, we calculate 
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that 
pH = forneSat= f sotpyesat 
(s=e(t)) [ foeror as= [ flee" He 


n~ 


= — f(o€) = pf (€). 


1 


Here we have used the fact that p~! = 'p for an orthogonal matrix. The proof 


is complete. Oo 


Definition For 6 > 0 and f € L1(R%) we set asf(x) = f(dxr) and 
a f(x) = 5-N f(a/d). These are the dual dilation operators of Euclidean anal- 
ysis. 


Proposition The dilation operators interact with the 
Fourier transform as follows: 


(asf) f) 


Proof: We calculate that 


(asf) =f (asf) (eat 
= / f(st)e”’s dt 
(s=6t) [ foeeinte ds 


= 5 Ff(E/6) 
= (aA) (©. 


That proves the first assertion. The proof of the second is similar. oO 


For any function f on R“ and a € RY we define 7, f(x) = f(a — a). 
Clearly 7, is a translation operator. 
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Proposition If f ¢ L'(R) then 
Tad (6) = 8 FG) 


and 


({7}) © = le“'70)] 


Proof: For the first equality, we calculate that 


—_. 


mH) =f eras \(w) de 
= | e'®§ F(a — a) dx 
RN 
(z—a)=t | eli+s)< Fg) dt 
RN 
= es | e'§ f(t) dt 
RN 
= 8 F(E). 
The second identity is proved similarly. Oo 


Much of the theory of classical harmonic analysis—especially in this 
century—concentrates on translation-invariant operators. An operator T on 
functions is called translation-invariant? if 


T(taf)(@) = (taT f)(2) 


for every x. It is a basic fact that any translation-invariant integral T operator 
is given by convolution with a kernel k: 


Tha) =f FOWMea— that, 


See the next subsection for more on this topic. 


3It is perhaps more accurate to say that such an operator commutes with translations. 
However, the terminology “translation-invariant” is standard. 
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Proposition For f € L'(R™) we let f(x) = f(—2x). Then 


f=f. 


Proof: We calculate that 


a= | Fe Jets dt = ca tyes dt 


fie je dt = fl_e) = F(O. o 


Proposition We have 


Proof: We calculate that 


F=f FOe* at = [ ftoeresat the-it€ dt = fl—8) 


I 
Ss) 


AG) Oo 


Proposition If f,g € L', then 


i: Fl@g(© dé = i HOG dé 


Proof: This is a straightforward change in the order of integration: 


[Fes = ff ree" ato 
“J Jonas 


266 CHAPTER 6: FOURIER SERIES: BASIC CONCEPTS 


6.6.1 Convolution and Fourier Inversion 


If f and g are integrable functions then we define their convolution to be 


feg(a ya fr (x —t)g Jatt) at = f F(e\g(e —t) at 


Note that a simple change of variable confirms the second equality. 


Proposition If f,g € L', then 


Proof: We calculate that 


oo tet? dt = J [ te-99 s)dse'" dt 


= ff te— spe" at g(s)et** ds 
_ i F(t)e"§ dt | Ase ods 


= f(€)-G(E). o 


6.6.2. The Inverse Fourier Transform 


Our goal is to be able to recover f from f. This program entails several 
technical difficulties. First, we need to know that the Fourier transform is 
one-to-one in order to have any hope of success. Secondly, we would like to 
say that 


= c- f Flge dé. (6.6.1) 


But in general the Fourier transform f of an L function f is not integrable 
(just calculate the Fourier transform of xj{o,1;—the characteristic function of 
the unit interval)—so the expression on the right of (6.6.1) does not necessar- 
ily make any sense. 
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Theorem If f, fe L' (and both are continuous), then 


#(0) = (2n)-% i: Fe) ae. (6.6.2) 


Of course there is nothing special about the point 0 € R%. We now exploit 
the compatibility of the Fourier transform with translations to obtain a more 
general formula. We apply formula (6.6.2) in our theorem to t_;,f : The result 
is 


(rn f) (0) = (2m)-™ i (r_afy(@) a (6.6.3) 


Theorem (The Fourier Inversion Formula) If 
f,f € L' (and if both f,f are continuous), then for any 
y € R we have 


f(y) = (@m)-% / FQecvedé. (6.6.4) 


This result follows from (6.6.3)—just write out the integral. See [KRA3] 
for details. 


Plancherel’s Formula 


We now give a treatment of the quadratic Fourier theory. 
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Proposition (Plancherel) If f € C%°(R), then 


(2n)-N / Flop ag = i) f(a)? de. (6.6.5) 


Proof: Define g = f * F € C%(R). Then 


(6.6.6) 


ta) 
SY 
shy 
Sk) 
SR) 
SS 
SY 
7 
Hy 
Sh) 
+) 
= 
= 


Now 
g(0) = f *F (0) = / f(—t)F(-t) dt = i f(i)F(t) dt = i Uf (t)[? at. 


By Fourier inversion and formula (6.6.6) we may now conclude that 


| LF ()|? dt = (0) = (2m)-% i G(E) dé = (2n)-™ / FOP ae. 
That is the desired formula. J 


Definition — For any square integrable function f, the Fourier transform of 
f can be defined in the following fashion: Let f; € CO° satisfy f; — f in the 
L? topology. It follows from the Proposition that tFy is Cauchy in L?. Let g 
be the L? limit of this latter sequence. We set f= g. 


It is easy to check that this definition of fis independent of the choice of 
sequence f; € Co° and that 


(2m)-% i} flO ae = / f(@)P de. 


Problems for Review and Discovery 


A. Drill Exercises 


1. Find the Fourier series for each of these functions. 
(a) f(a)=2°, -r<a<n 
(b) g(x)=2-|a|, —n<a<n 
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(c) A(xz)=ax24+|z2|, -t7<aK<r 
(d) f(a)=|t|, -mSa<cn 
@)oe={ it Oeres 
© wo={ Th it “Wace<n 
2. Calculate the Fourier series for each of these functions. 
sere gee —nanss 
ma y= {GG Teeew ee 
wmeefor  -gsrs! 
(d) f(@) = { Su oe 
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3. Sketch the graphs of the first three partial sums of the Fourier series for 
each of the functions in Exercise 2. 


4. Calculate the sine series of each of these functions. 


(a) f(z) =cos2x, O<a<a 


(b) g(x) =2?, O<aK<n 
(c) h(x) = 2 — |x —1/2|/2, 
(d) f(x) =2?+|x+1/4I, 


O<a<a7 
O<a<a7 


5. Calculate the cosine series of each of these functions. 


(a) f(z) =sin3a, O<a<a 


(b) g(t)=27, OSa<n 
(c) A(x) =#— |x —1/2|/2, 
(d) f(z) =2?+|x+1/4), 


O<a<a7 
O<a<a7 


6. Find the Fourier series expansion for the given function on the given 


interval. 

(a) f(z) =2?-a2, -l<a<l 
(b) g(x) =sinz, -2<a<2 

(c) h(x) =cost, -3<a4<3 

(a) f(@)=|s|, -1<e<1 

(e) o(z)=|2-1/2], -2<2<2 
(f) h(x) =|e@+4+1/2|/2, -3<a<3 


B. Challenge Problems 


1. In Section 4.4 we learned about the Legendre polynomials. The first three 


Legendre polynomials are 
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Verify that Po, P,, Pz are mutually orthogonal on the interval [—1, 1]. Let 


aah 6) eo 
i= P Oxe<d, 


Find the first three coefficients in the expansion 
f(x) = aoPo(ax) + a1 Pi(x) + a2P2(x)+---. 


Repeat Exercise 1 for the function f(x) = « + |z|. 
The first three Hermite polynomials are 


Ho(x) =1, Hi(x) = 2a , Ho(x) = 427? — 2. 


Verify that these functions are mutually orthogonal on the interval 


x 


(—co, 00) with respect to the weight e~” (this means that 


/ H;(x)Hy(x)e"” dz =0 
if 7 4k). Calculate the first three coefficients in the expansion 


f(x) = bo Ho() + b1 Hi (x) + bo Ho(x) +--- 


for the function f(x) = x — |a}. 
The first three Chebyshev polynomials are 


To(x) =1, Ti(z) =a, T(x) = 2x? -1 


Verify that these functions are mutually orthogonal on the interval [—1, 1] 
with respect to the weight (1—x?)~!/? (refer to Exercise 3 for the meaning 
of this concept). Calculate the first three coefficients in the expansion 


f(x) = coTo(x) + 1 Ti (x) + coT2(x) +--- 


for the function f(x) = x. 


C. Problems for Discussion and Exploration 


Refer to Fejér’s Theorem 6.2.6 about convergence of the Cesaro means. 
Confirm this result by direct calculation for these functions. 


—1 if —7<a2<0 

(a) se) ={ | if O<a<n 
0 if —-7<a<0 

(b) ate) = { cos x if O<a<7 


sin x if —7<a<0 

(©) We) ={ 2 if O<a<a 

_f je+i1f/2 if -rt<a<0 
(a) se) ={ 0 if O<a<n 
A celebrated result of classical Fourier analysis states that, if f is continu- 
ously differentiable on [—7, 7], then its Fourier series converges absolutely. 
Confirm this assertion (at all points except the endpoints of the interval) 
in the following specific examples. 
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(a) f(a) =« 
(b) g(x) = 2° 
(c) h(x) = e* 


3. In other expositions, it is convenient to define Fourier series using the 
language of complex numbers. Specifically, instead of expanding in terms 
of cos jz and sin jx, we instead expand in terms of e’!*. Specifically, we 
work with a function f on [—7, 7] and set 


1 ay =e" 
a=, f@e dé; 


We define the formal Fourier expansion of f to be 
Sfw PS ce”. 
j=—oo 


Explain why this new formulation of Fourier series is equivalent to that 
presented in Section 6.1 (i.e., explain how to pass back and forth from 
one language to the other). 


What are the advantages and disadvantages of this new, complex form 
of the Fourier series? 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 
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Laplace Transforms 


e The idea of the Laplace transform 

e The Laplace transform and differential equations 
e Derivatives and the Laplace transform 

e Integrals and the Laplace transform 

e Convolutions 

e Step and impulse functions 


e Discontinuous input 


rr 


7.1 Introduction 


The idea of the Laplace transform has had a profound influence on the develop- 
ment of mathematical analysis. It also plays a significant role in mathematical 
applications. More generally, the overall theory of transforms has become an 
important part of modern mathematics. 

The concept of a transform is that it turns a given function into another 
function. We are already acquainted with several transforms: 


I. The derivative D takes a differentiable function f (defined on some 
interval (a, b)) and assigns to it a new function Df = f’. 


II. The integral J takes a continuous function f (defined on some in- 
terval [a, b] and assigns to it a new function 


rye) = fo steae. 


III. The multiplication operator M,, which multiplies any given func- 
tion f on the interval [a,b] by a fixed function y on [a,b], is a 
transform: 


Mof(a) = ox): f(a). 
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We are particularly interested in transforms that are linear. A transform 
T is linear if 


Tlaf + 8g] = aT(f) + BT(g) 


for any real constants a, 3. In particular (taking a = @ = 1), 


Tif +9) =T(f)+T(9) 


and (taking 6 = 0) 
Taf) = oT (f). 


We would like to understand linear transforms that are given by integra- 
tion. Let f be a function with domain [0,00). The Laplace transform of f is 
defined by 


Lfl(s) = Fle) = f° e** F(a) az for s >0. 


Notice that we begin with a function f of x, and the Laplace transform D 
produces a new function L[f] of s. We sometimes write the Laplace transform 
as F'(s). Notice that the Laplace transform is an improper integral; it exists 
precisely when 


N 


| e °* f(a) dx = vim e °" f(x) dx 


0 


exists and is finite. Because of the presence of the factor e~*”, the Laplace 
transform exists and is well defined for a large class of functions f. 
Let us now calculate some Laplace transforms: 


Laplace transform F 
=1 F(s) =f, ¢ =- 
ae € 


f (2) 


ee ede = 


—Ss8x 


s—a? 
sin ax dx = 


VSL 


cos ax dx = aaa 


e**sinhardr=zoz, $>a 


*coshardr =z, $>a 
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We shall not actually perform all these integrations. We content ourselves 

with the third one, just to illustrate the idea. The student should definitely 

perform the others, just to get the feel of Laplace transform calculations. 
Now 


L[z"|(s) = | ” ea” dn 


The reader will find, as we just have, that integration by parts is eminently 
useful in the calculation of Laplace transforms. 

It may be noted that the Laplace transform is a linear operator. Thus 
Laplace transforms of some compound functions may be readily calculated 
from the table just given: 


5-3! 2 
L(5a° — 2e|(s) = = 
(5a e*](s) a aa 
and 
F 4-2 
L(Asin 2a + 62](s) = aD + 2 


a 


Exercises 


1. Evaluate all the Laplace transform integrals for the table in this section. 


2. Without actually integrating, show that 
‘ a 
(a) L{sinh az] = G2 42: 
8 
b) L[cosh = 
(b) L{cosh az] a 
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Find L[sin? aa] and L|cos? ax] without integrating. How are these two 
transforms related to one another? 

Use the formulas given in the text to find the Laplace transform of each 
of the following functions. 


(a) 10 (d) 4sinxcosa +2e~* 


(b) «x? + cos 2x (e) 2° sin? 3a + 2° cos? 3x 
(c) 2e°” — sin5zx 


Find a function f whose Laplace transform is: 
30 1 
— d 
OF @ 
6 5 © az 
(c) 4 4 6 
©) 38" 5244 


[Hint: The method of partial fractions will prove useful.] 


Give a plausible definition of 3! (i-e., the factorial of the number 1/2). 


Use your symbol manipulation software, such as Maple or Mathematica, 
to calculate the Laplace transforms of each of these functions. 


(a) f(x) = sin(e*) 
x) = In(1 + sin? 2) 


(x) 
(c) h(x) = sin[Inz] 
(x) 


7.2 Applications to Differential Equations 


The key to our use of Laplace transform theory in the subject of differential 
equations is the way that L treats derivatives. Let us calculate 


Liy'\(s). = | * omy! (2) dx 


—sx 


= y(x)je 


+ | e **y(a) dx 
0 0 


= —y(0)+s-Llyl(s). 


In the second equality we of course used integration by parts. 


In summary, 


Ly'\(s) = s- L[y](s) — y(0). 
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Likewise, 


Ly'"\(s) 


lI 
tS 
— 
Nong 
II 
wH 
tS 
< 
—~ 
w 
wa 
| 
~ 


y’(0) 
s{s: Hes y(0)} — y/(0) 
s* - L[y|(s) — sy(0) — y/(0 


Now let us examine the differential equation 


y +ay’ + by = f(z), (721) 


with the initial conditions y(0) = yo and y’(0) = y. Here a and 6 are real 
constants. We apply the Laplace transform L to both sides of (7.2.1), of course 
using the linearity of L. The result is 


Lly"](s) + aL[y'](s) + bL[yl(s) = L{fl(s) - 


For brevity, we often omit explicit mention of the new independent variable 
s for the Laplace transform function. Writing out what each term is, we find 
that 


{s” - L[y] — sy(0) — y/(0)} + a{s - Lly] — y(0)} + bL[y] = Lf]. 


Now we can plug in what y(0) and y‘(0) are. We may also gather like terms 
together. The result is 


{ 3? tas+ b} Ly] =(sta)y+y+ Lif] 


(s+a)yot+ yi + Lf] 

Ly] = == =o ingeaky oe (7.2.2) 
What we see here is a remarkable thing: The Laplace transform changes 
solving a differential equation from a rather complicated calculus problem 
to a simple algebra problem. The only thing that remains, in order to find 
an explicit solution to the original differential equation (7.2.1) with initial 
conditions, is to find the inverse Laplace transform of the right-hand side of 
(7.2.2). In practice we shall find that we can often perform this operation in 

a straightforward fashion. The following examples will illustrate the idea. 


EXAMPLE 7.2.3 Use the Laplace transform to solve the differential equation 
y" + 4y = 4x (7.2.3.1) 
with initial conditions y(0) = 1 and y’(0) =5. 


Solution: 
We proceed mechanically, by applying the Laplace transform to both sides of 
(7.2.3.1). Thus 

Lly"| + L[4y] = L[4a) . 
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We can use our various Laplace transform formulas to write this out more 


explicitly: 
{s’Lly] — sy(0) — y’(0)} + 4L[y] =5 


or 
4 
s*Lly]— 8-1-5 + 4Lfy| = 3 
or 
2 4 
(s +4)Ly|=st+5+ 5. 
It is convenient to write this as 
fig= 8 4. i) 4: 4 _ ss i i) i 1 1 
W244 44 s2.(s2+4)  82+4 s2+4 52 9244’ 


where we have used a partial fractions decomposition in the last step. Simpli- 


fying, we have 
r= Ss if 4 i 1 
og +4  s2+4 82° 


Referring to our table of Laplace transforms, we may now deduce what y 
must be: 


Lly] = Licos 2a] + L[2 sin 2a] + Lia] = Li[cos 2% + 2sin 2x +4 a]. 


We deduce then that 


y =cos2x%+2sin2x74 a2, 


and this is the solution of our initial value problem. | 


REMARK 7.2.4 It is useful to note that our formulas for the Laplace transform 
of the first and second derivative incorporated automatically the values y(0) 
and y’(0). Thus our initial conditions got built in during the course of our 
solution process. 


A useful general property of the Laplace transform concerns its interaction 
with translations. Indeed, we have 


Lle* f(x)] = F(s—a). (7.2.5) 
To see this, we calculate 


Le f(a] = 


I 
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We frequently find it useful to use the notation L~! to denote the inverse 
operation to the Laplace transform.! For example, since 


2! 
L[z”] == 
]= 5, 
we may write 
2! 
L7t ae) eee 2 
Oa 
Since 1 1 
L 4% _ 2x _ a3 
poe s?+1 s—-2’ 
we may write 
1 1 
“(sh 5) =sine 2. 
8 s— 
EXAMPLE 7.2.6 Since b 
Lsin bz] = > TR 
we conclude that , 
Lle® sin ba] GoaP ae 


Since 


we thus have 


| Fe (5) = ere, a 


EXAMPLE 7.2.7 Use the Laplace transform to solve the differential equation 
y” + 2y’ + 5y = 3e~* sin x (7.2.7.1) 
with initial conditions y(0) = 0 and y’(0) = 3. 


Solution: 
We calculate the Laplace transform of both sides, using our new formula 
(7.2.5) on the right-hand side, to obtain 


1 


{s?L{u) ~ sy(0) — 9!(0)} +2 (sLfu) ~ w(0)} + SL] =3- Ty 


'We tacitly use here the fact that the Laplace transform L is one-to-one: if L[f] = L[g] 
then f = g. Thus L is invertible on its image. We are able to verify this assertion empir- 
ically through our calculations; the general result is proved rigorously in a more advanced 
treatment. 
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Plugging in the initial conditions, and organizing like terms, we find that 


3 
249 5)L[y| = 3 + ————_ 
ae Ween aed 
or 
3 3 
L ee ee eee 
WI s?4+2s+5 (5s? + 25+ 2)(s? +2545) 
7 3 1 1 
~  g24+29s4+5  s24+2s+2 82742845 


2 i 
CRERD ee ae CR PE 


Of course we have used partial fractions in the second equality. 
We see therefore that 


y=e “sin2x+e “sinz. 


This is the solution of our initial value problem. | 


SSS Ee so ss0__007 
Exercises 
1. Find the Laplace transforms of 
(a) ae?” (e) 8" cos 2x 
(b) (1—2?)e~* (f) xe 
(c) e *sing (g) x’ cosa 
(d) «sin3a (h) sinacosz 


2. Find the inverse Laplace transform of 


6 12 
(a) (e422 +9 (d) (s+3) 
Os © aT 
Ss 6 
©) yi © (s—13 


3. Solve each of the following differential equations with initial values using 
the Laplace transform. 


(a) y’+y=e", y(0)=0 
(b) y” —4y'+4y=0, y(0) =0 and y’(0) =3 
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(c) y+ 2y'+2y=2, y(0) =O and y/(0) = 1 
(d) y"+y'=30?, (0) =O and y/(0) =1 
(e) y”’ +2y’+5y=3e "sinz, y(0)=0 and y’(0) =3 

4. Find the solution of y” — 2ay’ + a?y = 0 in which the initial conditions 
y(0) = yo and y'(0) = yo are left unrestricted. (This provides an addi- 
tional derivation of our earlier solution for the case in which the auxiliary 
equation has a double root.) 


5. Apply the formula L[y’] = sL[y] — y(0) to establish the formula for the 
Laplace transform of an integral: 


o(f rear) = 72. 


Do so by finding 


in two different ways. 
6. Solve the equation 


yf tay +5 f ydzr =e” , y(0) = 0. 
0 


7.3 Derivatives and Integrals of Laplace Transforms 


In some contexts it is useful to calculate the derivative of the Laplace transform 


of a function (when the corresponding integral makes sense). For instance, 
consider 


Fs) =f e °* f(a) dx. 
Then 


d d - —sx 
asi) — al e °* f(a) dx 


2 [ < [e-** f(a] dex 


| * cf de 21-2 ON). 


We see that the derivative? of F(s) is the Laplace transform of —xf (a). More 
generally, the same calculation shows us that 


& : 
Gon (8) = Ele*F(2)]() 


2The passage of the derivative under the integral sign in this calculation requires ad- 
vanced ideas from real analysis which we cannot treat here—see [KRA2]. 
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and di 
aaFls) = L{(-1)/ x? f(x)|(s) . 


EXAMPLE 7.3.1 Calculate 


L[x sin az]. 
Solution: 
We have 
d d 2 
Dlx sin ax] = —L[—xsinaz] = — 3G, bisin ax] = ee = Pia? : 


EXAMPLE 7.3.2 Calculate the Laplace transform of 4/2. 


Solution: 
This calculation actually involves some tricky integrations. We first note that 


Lil Va] = L{x1/?] = -L[-2- a t/2] = _@ 


Ss 


Lia—'/?] , (7.3.2.1) 


Thus we must find the Laplace transform of «~!/?. 


Now es 
L{a2~1/?] = | eg? de. 
0 


The change of variables sz = t yields 
= ad ee ae: 
0 
The further change of variables t = p? gives the integral 


Lia~*/?| ny e?” dp. (7.3.2.2) 
0 


Now we must evaluate the integral J = neg eR dp. Observe, introducing 
the dummy variable u, that 


oe) F oe) co pr /2 3 
i=] e ? ap: | e “ au= f i. e " -rdédr. 
0 0 0 Jo 


Here we have introduced polar coordinates in the standard way. 
Now the last integral is easily evaluated and we find that 


put 
4 


7.8. DERIVATIVES AND INTEGRALS 283 
hence I = \/7/2. Thus L[x—1/?](s) = 2s—'/?{,/m/2} = \/z/s. Finally, 


d [x JT 
Lv] = ~ds\ is 253/2° = 


We now derive some additional formulas that will be useful in solving 
differential equations. We let y = f(x) be our function and Y = L/f] be its 
Laplace transform. Then 


Also 


and 


EXAMPLE 7.3.6 Use the Laplace transform to analyze Bessel’s equation 
ay’ +y' +ry=0 
with the single initial condition y(0) = 1. 


Solution: 
Apply the Laplace transform to both sides of the equation. Thus 


Lay") + Ly’) + Li[xy] = L[0] = 0. 


We can apply our new formulas (7.3.5) and (7.3.3) to the first and third terms 
on the left. And of course we apply the usual formula for the Laplace transform 
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of the derivative to the second term on the left. The result is 


[--('¥) +1] 4 [sY 11+ | ~ 


We may simplify this equation to 


dY 
(s? + 1) = —sY. 


This is a new differential equation, and we may solve it by separation of 


variables. Now 
dY _ sds 


Ys g2+1 


so 1 
mY =—Zn(s’+1)+C. 


Exponentiating both sides gives 


1 


Y=D-.—_——. 
s? +1 


It is useful (with a view to calculating the inverse Laplace transform) to 


write this solution as 
1/2 
Yoel (7.3.6.1) 
: 2 : 3.6. 


Recall the binomial expansion: 


- a(a —1) a(a — 1)(a — 2) 
(l+z)* = Loe eee ee 
SPeavlas 1 
in 4 ala ) _ n+ ) ma 


We apply this formula to the second term on the right of (7.3.6.1) with the 
role of z played by 1/s?. Thus 


D 1, hs Oe SD Be Tb AT 35 to 
y= 2-5-4545 


The good news is that we can now calculate L~! of Y (thus obtaining y) 
by just calculating the inverse Laplace transform of each term of this series. 
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The result is 


oo 1 j - 
yz) = D- ys Sanne a 


a at 8 
D-(1-Steeg- page t')- 


Since y(0) = 1 (the initial condition), we see that D = 1 and 


I 


2 at ina 


x 
Wl ata Beet 
It should be noted that the origin x9 = 0 is a regular singular point for 
this differential equation, and the Frobenius values of m are m = 0,0. That 
explains why we do not have two undetermined constants in our solution. 
The series we have just derived defines the celebrated and important 
Bessel function Jo. We have learned that the Laplace transform of Jo is 


1/Vs? +1. a 


It is also a matter of some interest to integrate the Laplace transform. 
We can anticipate how this will go by running the differentiation formulas in 
reverse. Our main result is 


re (2) = oe F(s)ds. (7.3.7) 


ia F(t) dt i“ ts é f(a) i) dt 
= i f(a) a e- dtde 


In fact 
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EXAMPLE 7.3.8 Use the fact that Llsinz] = 1/(s? + 1) to calculate 
Jo. (sin x)/2 de. 


Solution: 
By formula (7.3.7) (with f(z) = sin), 


Co 


| we de = Lisinx/2(0) = f 2 = arctan s| =<. a 


We conclude this section by summarizing the chief properties of the 
Laplace transform in a table. As usual, we let F'(s) denote the Laplace trans- 
form of f(x) and G(s) denote the Laplace transform of g(x). The last property 
listed in this table concerns convolution, and we shall treat that topic in the 
next section. 


Properties of the Laplace Transform 


Llaf(x) + Bg(a)] = aF(s) + BG(s) 
Le f(x)| = F(s — a) 
L{f'(«)| = sF(s) — f(0) 

Lf" (x)] = s°F(s) — sf(0) — f’(0) 


£ (22) =f rsyas 


Exercises 
1. Verify that 
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(rap): 


2. Calculate each of the following Laplace transforms. 
(a) Lx? sin az] 
(b) L[x*/?] 
(c) Lia cos az] 
(d) L{xe*] 
3. Solve each of the following differential equations. 
(a) zy” + (3x —1)y’ — (44+ 9)y =0 
(b) xy” + (22 + 3)y’ + (c+ 3)y = 3e-” 
4. Ifa and bare positive constants, then evaluate the following integrals. 


co ax .—bex 
(a) [ ae 
0 


x 


w) | e sin bx Ae 
0 x 


Use this result to find 


5. Without worrying about convergence issues, verify that 
(a) | Jo(x) dx =1 
0 
(b) Jo(x) = -/ cos(« cos t) dt 
T 


0 
6. Without worrying about convergence issues, and assuming x > 0, show 


that 
* sin xt 1 
(a) fay= [a= 
0 
~ cos xt To 
b = dt = =e * 
(b) fle) = f° EE at = Fe 
7. (a) If f is periodic with period a, so that f(a +a) = f(x), then show 
that 


F(s) me=/ e ' f(a)dz. 


~ [e748 

(b) Find F(s) if f(a) = 1 in the intervals [0,1], [2,3], [4,5], etc., and 
f =0 in the remaining intervals. 
8. If y satisfies the differential equation 
y” ae xy = 0, 
where y(0) = yo and y’(0) = y(, then show that its Laplace transform 
Y(s) satisfies the equation 
Y"4+s8°Y = syo + yo. 


Observe that the new equation is of the same type as the original equa- 
tion, so that no real progress has been made. The method of Example 
7.2.6 is effective only when the coefficients are first-degree polynomials. 
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7.4  Convolutions 


An interesting question, that occurs frequently with the use of the Laplace 
transform, is this: Let f and g be functions and F' and G their Laplace trans- 
forms; what is L~'[F' - G]? To discover the answer, we write 


(| e* F(t) it) (| e *" f(u) au) 


[ ye e+") F(t) g(u) dtdu 


= | é cnet) 9(0 at) glu) du, 


Now we perform the change of variable x = t+ u in the inner integral. 


The result is 
| (/ e °* f(a —u) ax) g(u) du 
0 t 


ie e °* f(a —u)g(u) dadu. 
0 Jt 


I 


F(s)- G(s) 


I 


I 


F(s)- G(s) 


l 


Reversing the order of integration, we may finally write 


F@)-€@): = i 7 ( i ere) au) de 


= ie es eo Fe Seta) au) ae 


ae! [He = wate au] 


We call the expression {> f(a — u)g(u) du the convolution of f and g. 
Many texts write 


Our calculation shows that 


Lif * g\(s) =F. G=L/f]- L[g]. 


The convolution formula is particularly useful in calculating inverse 
Laplace transforms. 
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EXAMPLE 7.4.2 Calculate 


1 1 1 
Lot ae ee = Stop ose : 
(a) L € ssi) 


/ (a —t)-sintdt. 
0 


I 


Notice that we have recognized that 1/s? is the Laplace transform of x and 
1/(s? +1) is the Laplace transform of sin x, and then applied the convolution 
result. 
Now the last integral is easily evaluated (just integrate by parts) and seen 
to equal 
r—sing. 


We have thus discovered, rather painlessly, that 
je } =2-—sinz | 
s2(s2+1)) 


The reader may note that this last example could also be done by using 
partial fractions. 

An entire area of mathematics is devoted to the study of integral equations 
of the form 


fla) = va) + f° ke —oy(e) at. (7.4.3) 


Here f is a given forcing function, and k is a given function known as the ker- 
nel. Usually k is a mathematical model for the physical process being studied. 
The object is to solve for y. As you can see, the integral equation involves a 
convolution. And, not surprisingly, the Laplace transform comes to our aid in 
unraveling the equation. 

In fact we apply the Laplace transform to both sides of (7.4.3). The result 
is 

Lf] = Lly] + LIK] - Ly] 

hence 


Lf] 
Ly] = ——... 
w= om 
Let us look at an example in which this paradigm occurs. 


EXAMPLE 7.4.4 Use the Laplace transform to solve the integral equation 


rn) = x? Tease: ‘ 
y(2) + [sing t)y(t) at 
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Solution: We apply the Laplace transform to both sides (using the convo- 
lution formula): 
L[y] = L[x?] + L[sin 2] - L[y]. 


Solving for Ly], we see that 


Lx} 3!/s4 


Mel = 7 Tein] ~ Ta 1/PFD) 


We may simplify the right-hand side to obtain 
3! 3! 


Of course it is easy to determine the inverse Laplace transform of the right- 


hand side. The result is 


g, 2 
=e +=. a 
y(“) =a 50 


7.4.1 Abel’s Mechanics Problem 


We now study an old problem from mechanics that goes back to Niels Henrik 
Abel (1802-1829). Imagine a wire bent into a smooth curve (Figure 7.1). The 
curve terminates at the origin. Imagine a bead sliding from the top of the 
wire, without friction, down to the origin. The only force acting on the bead 
is gravity, depending only on the weight of the bead. Say that the wire is the 
graph of a function y = y(a). Then the total time for the descent of the bead 
is some number T(y) that depends on the shape of the wire and on the initial 
height y. Abel’s problem is to run the process in reverse: Suppose that we are 
given a function T. Then find the shape y of a wire that will result in this 
time-of-descent function T. 

What is interesting about this problem, from the point of view of the 
present section, is that its mathematical formulation leads to an integral equa- 
tion of the sort that we have just been discussing. And we shall be able to 
solve it using the Laplace transform. 

We begin our analysis with the principle of conservation of energy. Namely, 


Ln (SB) = m-g--9) 
aM (a) =m-g-(y—v). 


In this equation, m is the mass of the bead, ds/dt is its velocity (where of 
course s denotes arc length), and g is the acceleration due to gravity. We 
assume that s’(0) = 0. 

We use (u,v) as the coordinates of any intermediate point on the curve. 
The expression on the left-hand side is the standard one from physics for 
kinetic energy. And the expression on the right is the potential energy. 
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FIGURE 7.1 
A smooth curve which a bead will slide down. 


We may rewrite the last equation as 


where the minus sign occurs because the bead is falling. This is equivalent to 
ds 
Vv 2g(y — v) 


Integrating from v = y to v = 0 yields 


dt = — 


“We de (7.4.5) 


tw) [on [eal SS 


Now we know from calculus how to calculate the length of a curve: 


sana [om 


f(y) = s'(y) =4f/1+ (*). (7.4.6) 


hence 


Substituting this last expression into (7.4.5), we find that 
(y) 1 i ¥ f(v) dv 
y= 

Vv2g Jo VY 


This formula, in principle, allows us to calculate the total descent time T(y) 
whenever the curve y is given. From the point of view of Abel’s problem, 


(7.4.7) 
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the function T(y) is given, and we wish to find y. We think of f(y) as the 
unknown. Equation (7.4.7) is called Abel’s integral equation. 

We note that the integral on the right-hand side of Abel’s equation is a 
convolution (of the functions y~!/? and f). Thus, when we apply the Laplace 
transform to (7.4.7), we obtain 


L{T(y)] = i LU). 


Now we know from Example 7.2.2 that L[y~!/?] = \/m/s. Hence the last 
equation may be written as 


LIf(y)] = V29- —_ = ee gS LPG) (7.4.8) 


When T(y) is given, then the right-hand side of (7.4.8) is completely known, 
so we can then determine L[f(y)] and hence y (by solving the differential 
equation (7.4.6)). 


EXAMPLE 7.4.9 Analyze the case of Abel’s mechanical problem when T(y) = 
To, a constant. 


Solution: Our hypothesis means that the time of descent is independent of 
where on the curve we release the bead. A curve with this property (if in fact 
one exists) is called a tautochrone. In this case equation (7.4.8) becomes 


Lfc)] = yf 2s" *zin = yf 72 ove 7 = ove / 


where we have used the shorthand b = 2g73/n?. Now L~"[,/x/s] = y~'/?, 
hence we find that 


(=a (7.4.9.1) 


Now the differential equation (7.4.6) tells us that 
da\"? _ b 
14 (4) =2 
dy y 


] y 


Using the change of variable y = bsin? ¢, we obtain 


2b | cos? oad 


hence 


x 
= b [ (1 + 00826) do 


= (26 + sin 26) ee 
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FIGURE 7.2 
The cycloid. 


In conclusion, 
b b 
r= 5(2o+sin2¢)+C and ~— y= 5(1 — cos2¢). (7.4.9.2) 


The curve must, by the initial mandate, pass through the origin. Hence C = 0. 
If we put a = 6/2 and @ = 2¢ then (7.4.9.2) takes the simpler form 


x = a(6 + sin @) and y = a(1—cos@). 


These are the parametric equations of a cycloid (Figure 7.2). A cycloid is a 
curve generated by a fixed point on the edge of a disc of radius a rolling along 
the x-axis. See Figure 7.3. We invite the reader to work from this synthetic 
definition to the parametric equations that we just enunciated. | 


Thus the tautochrone turns out to be a cycloid. This problem and its 
solution is one of the great triumphs of modern mechanics. An additional 
very interesting property of this curve is that it is the brachistochrone. That 
means that, given two points A and B in space, the curve connecting them 
down which a bead will slide the fastest is the cycloid (Figure 7.4). This last 
assertion was proved by Isaac Newton, who read the problem as posed in a 
public challenge by Bernoulli in a periodical. Newton had just come home from 
a long day at the British Mint (where he worked after he gave up his scientific 
work). He solved the problem in a few hours, and submitted his solution 
anonymously. But Bernoulli said he knew it was Newton; he “recognized the 
lion by his paw.” 
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FIGURE 7.3 
Generating the cycloid. 


FIGURE 7.4 
The brachistrochrone. 
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Exercises 


1. Find L~'[1/(s? + a?)] by using convolution. [Hint: Refer to Exercise 1 
of the last section.] 


2. Solve each of the following integral equations. 


(a) v(e)=1- f “(e—t)y(t) at 


0 


(b) y(ax) =e" (1 +f e ‘y(t) ar) 


0 


(c) e “=y(x)4+2 n cos(x — t)y(t) dt 


) 
(d) 3sin 2x = y(a) +f (a — t)y(t) dt 


3. Find the equation of the curve of descent if T(y) = k,/y for some constant 
k. 


4. Show that the initial value problem 


y t+ay= f(x), (0) =y'(0) =0, 
has solution 


aad : sin a(x — , 
ula) =~ f° W)simale at 


a 


7.5 The Unit Step and Impulse Functions 


In this section our goal is to apply the formula 


Lif * 9] = Lif) - Lgl 


to study the response of an electrical or mechanical system. 

Any physical system that responds to a stimulus can be thought of as a 
device (or black box) that transforms an input function (the stimulus) into 
an output function (the response). If we assume that all initial conditions are 
zero at the moment t = 0 when the input f begins to act, then we may 
hope to solve the resulting differential equation by application of the Laplace 
transform. 

To be more specific, let us consider solutions of the equation 


y+ ay’ + by = f 
satisfying the initial conditions y(0) = 0 and y’(0) = 0. Notice that, since the 


equation is nonhomogeneous, these zero initial conditions cannot force the 
solution to be identically zero. The input f can be thought of as an impressed 
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external force F' or electromotive force E' that begins to act at time t = 0—just 
as we discussed when we considered forced vibrations. 
When the input function happens to be the unit step function (or heaviside 
function) 
0 if t<0O 
{ 1 if ¢t20, 


then the solution y(t) is denoted by A(t) and is called the step response (or 
indicial response). That is to say, 


A" + aA’ +bA=u. (7.5.1) 


Now, applying the Laplace transform to both sides of (7.5.1), and using 
our standard formulas for the Laplace transforms of derivatives, we find that 


1 
s’L[A] + asL[A] + bL[A] = L[u] = —. 
8 
Here we have calculated that L[u](s) = 1/s. 
So we may solve for L[A] and obtain that 
1 1 ee 
L[|A] = - - =———- = - -— (7.5.2) 


s s+ast+b 8 2z(s) 
where 
z(s)=s*?+as+b. (7.5.3) 


Note that we have just been examining the special case of our differential 
equation with a step function on the right-hand side. Now let us consider the 
equation in its general form (with an arbitrary external force function /f): 


y’ +ay'+by=f. 


Applying the Laplace transform to both sides (and using our zero initial con- 
ditions) gives 
s*L[y] + asLly] + bL[y] = Lf] 


Lly| - 2(s) = Lf] 
Lif 
Lly| = a. (7.5.4) 
We divide both sides of (7.5.4) by s and use (7.5.2). The result is 
5 Ell = ey EN] = 2A): ELA. 


This suggests the use of the convolution theorem: 


~- E{y) = L[Axf]. 
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As a result, 


Lily] = o-1(f ates ar) 


= L($ [ Ae—nyeer) 


Thus we finally obtain that 


y(t) = if A(t —7)f(r) dr. (7.5.5) 


What we see here is that, once we find the solution A of the differential 
equation with a step function as an input, then we can obtain the solution 
for any other input f by convolving A with f and then taking the derivative. 
With some effort, we can rewrite equation (7.5.5) in an even more appealing 
way. 

In fact we can go ahead and perform the differentiation in (7.5.5) to obtain 


y(t) = | Al(t—1)f(r) dr + A(0)f (2). 


Alternatively, we can use a change of variable to write the convolution as 


| f(t-—o)A(o)do. 
0 


This results in the formula 


y(t) = | fi(t— 0) Alo) do + f(0)A(). 


Changing variables back again, this gives 


w= | A(t — 7) f'(r) dr + f(O)A(C). (7.5.6) 


We notice that the initial conditions force A(0) = 0 so our other formula 
(7.5.6) becomes 


y(t) = ii A'(t— 7) f(r) dr. (7.5.7) 


Either of (7.5.6) or (7.5.7) is commonly called the principle of superposition. 
They allow us to represent a solution of our differential equation for a general 
input function in terms of a solution for a step function. 


EXAMPLE 7.5.8 Use the principle of superposition to solve the equation 
y+ y’ ae 6y oe et 
with initial conditions y(0) = 0, y’(0) = 0. 
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Solution: We first observe that 

2(s)=s?+s—-6 
(see the discussion of equation (7.5.3)). Hence 


1 


me s(s?+5—6) | 


Now it is a simple matter to apply partial fractions and elementary Laplace 
transform inversion to obtain 
1 1 1 
A #t) = 2 ae —e3t 4 — et. 
(¢) 6 15 10 


Now f(t) = 2e**, f’(t) = 6e, and f(0) = 2. Thus (7.5.6) gives 


y(t) 


lI 
oo 
oS 

a 
+ 
an 
&l 
a 
se 
t 
a 
+ 
a 
fa 
yes 
, 
& 
SS 
a 
® 
w 
=) 
Q 
4 


1 1 1 
9{— __p—3t pat 
“hs ( 6 + n° + 10° ) 


We invite the reader to confirm that this is indeed a solution to our initial 
value problem. B 


We can use the second principle of superposition, rather than the first, to 
solve the differential equation. The process is expedited if we first rewrite the 
equation in terms of an impulse (rather than a step) function. 

What is an impulse function? Physicists think of an impulse function as 
one that takes the value 0 at all points except the origin; at the origin the 
impulse function takes the value +oo. See Figure 7.5. 

In practice, mathematicians think of an impulse function as a limit of 
functions 
if O<a<e 
pe(x) = 


o alr 


if a«>e 


as ¢ > 0+. See Figure 7.6. Observe that, for any « > 0, {5° ¢-(x) dx = 1. It is 
straightforward to calculate that 


1 —_ e %€ 


SE 


Lpe] = 
and hence (using l’Hépital’s Rule) that 


lim Llye|(s) =1 for all s. 
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FIGURE 7.5 


An impulse function. 


FIGURE 7.6 
An impulse function as a limit of tall, thin bumps. 
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Thus we think of the impulse—intuitively—as an infinitely tall spike at the 
origin with Laplace transform identically equal to 1. The mathematical justi- 
fication for the concept of the impulse was outlined in the previous paragraph. 
A truly rigorous treatment of the impulse requires the theory of distributions 
(or generalized functions) and we cannot cover it here. We do give a brief 
introduction to distributions in the next chapter. 

It is common to denote the impulse function by 6(¢) (in honor of Paul 
Dirac, who developed the idea).* We have that 


In the special case that the input function for our differential equation 
is f(t) = 6, then the solution y is called the impulsive response and denoted 
h(t). In this circumstance we have 


Lh] = ro 
hence : 
Ati= 07-7 (= ; 
Now we know that : ; r 
L{A] = - He) = a 


As a result, 


But this last formula shows that A’(t) = h(t), so that our second super- 
position formula (7.5.7) becomes 


y(t) = | h(t —7) f(r) dr. (7.5.8) 


In summary, the solution of our differential equation with general input func- 
tion f is given by the convolution of the impulsive response function with 


f. 
EXAMPLE 7.5.9 Solve the differential equation 
y" 4 y’ -_ 6y = 2e3t 


with initial conditions y(0) = 0 and y’(0) = 0 using the second of our super- 
position formulas, as rewritten in (7.5.8). 


3It should be noted that, strictly speaking, the Dirac impulse function is not a function. 
But it is nonetheless useful, from an intuitive point of view, to treat it as a function. Modern 
mathematical formalism provides rigorous means to handle this object. 
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Solution: We know that 


nt) = 17 ss) 


As a result, 


y(t) 


II 
o— 
ou ke 
a 
fay) 
iS) 
aN 
| 
4 
YS 
.j 
w 
- 
| 
a 
NY 
bo 
fay) 
w 
+ 
Q 
4 


Of course this is the same solution that we obtained in the last example, using 
the other superposition formula. | 


To form a more general view of the meaning of convolution, consider a 
linear physical system in which the effect at the present moment of a small 
stimulus g(r) dr at any past time T is proportional to the size of the stimulus. 
We further assume that the proportionality factor depends only on the elapsed 
time t — 7, and thus has the form f(t— 7). The effect at the present time t is 
therefore 

f(t—7)-g(r) dr. 


Since the system is linear, the total effect at the present time t due to the 
stimulus acting throughout the entire past history of the system is obtained 
by adding these separate effects, and this observation leads to the convolution 
integral 


/ f(t—T)g(r) dr. 
0 


The lower limit is 0 just because we assume that the stimulus started acting 
at time t = 0, ie., that g(r) = 0 for all 7 < 0. Convolution plays a vital role in 
the study of wave motion, heat conduction, diffusion, and many other areas 
of mathematical physics. 
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Exercises 


1. 


Find the convolution of each of the following pairs of functions: 


(a) 1, sinat 
(b) ec, e” foraf#b 
(c) t, e® 


sinat, sinbt fora 

d) si in bt fe b 
Verify that the Laplace transform of a convolution is the product of the 
Laplace transforms for each of the pairs of functions in Exercise 1. 

Use the methods of Examples 8.4.8, 8.4.9 to solve each of the following 
differential equations. 

(a) y" + 5y' + 6y = 5e™, ie 
(b) y”+y'—6y=t, y(0 
(c) y’-y =, y(O)= 
When the polynomial z(s) 


1 1 A B 
a ~ 


for suitable constants A and B, then 
h(t) = Ae + Be”. 


Also equation (7.5.8) takes the form 


t 
t)= / f(r) [Ae + Bee“) dr. 
0) 


This formula is sometimes called the Heaviside expansion theorem. 


(a) Use this theorem to write the solution of y’ + 3y’ + 2y = f(b), 
y(0) = y'(0) =0. 

(b) Give an explicit evaluation of the solution in (a) for the cases f(t) = 
e* and f(t) =t 

(c) Find the solutions in (b) by using the superposition principle. 

Show that f * g = g * f directly from the definition of convolution, by 

introducing a new dummy variable 0 = t — rT. This calculation shows 

that the operation of convolution is commutative. It is also associative 

and distributive: 


«|g hl =([f*g]*h 
and 
felgthl=fregtfrh 
and 
[f+glxh=fxhtgxh. 


Use a calculation to verify each of these last three properties. 
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6. We know from our earlier studies that the forced vibrations of an un- 
damped spring-mass system are described by the differential equation 


Ma" +kx = f(t), 


where x(t) is the displacement and f(t) is the impressed external force 
or “forcing function.” If (0) = x’(0) = 0, then find the functions A and 
h and write down the solution x(t) for any f(¢). 

7. The current J in an electric field with inductance L and resistance R is 
given (as we saw in Section 1.12) by 


I 
L— I=E. 

at +R 
Here FE is the impressed electromotive force. If [(0) = 0, then use the 
methods of this section to find J in each of the following cases. 
(a) E(t) = Eou(t) 
(b) E(t) = Eod(t) 
(c) E(t) = Eosinwt 


(I 
Historical Note 
Laplace 


Pierre Simon de Laplace (1749-1827) was a French mathematician and theo- 
retical astronomer who was so celebrated in his own time that he was some- 
times called “the Isaac Newton of France.” His main scientific interests were 
celestial mechanics and the theory of probability. 

Laplace’s monumental treatise Mécanique Céleste (published in five vol- 
umes from 1799 to 1825) contained a number of triumphs, including a rigorous 
proof that our solar system is a stable dynamical system that will not (as New- 
ton feared) degenerate into chaos. Laplace was not always true to standard 
scholarly dicta; he frequently failed to cite the contributions of his predeces- 
sors, leaving the reader to infer that all the ideas were due to Laplace. 

Many anecdotes are associated with Laplace’s work in these five tomes. 
One of the most famous concerns an occasion when Napoleon Bonaparte en- 
deavored to get a rise out of Laplace by protesting that he had written a huge 
book on the system of the world without once making reference to its author 
(God). Laplace is reputed to have replied, “Sire, I had no need of that hypoth- 
esis.” Lagrange is reputed to have then said that, “It is a beautiful hypothesis 
just the same. It explains so many things.” 

One of the most important features of Laplace’s Mécanique Céleste is its 
development of potential theory. Even though he borrowed some of the ideas 
without attribution from Lagrange, he contributed many of his own. To this 
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day, the fundamental equation of potential theory is called “Laplace’s equa- 
tion,” and the partial differential operator involved is called “the Laplacian.” 

Laplace’s other great treatise was Théorie Analytique des Probabilités 
(1812). This is a great masterpiece of probability theory, and establishes many 
analytic techniques for studying this new subject. It is technically quite sophis- 
ticated, and uses such tools as the Laplace transform and generating functions. 

Laplace was politically very clever, and always managed to align himself 
with the party in power. As a result, he was constantly promoted to ever more 
grandiose positions. To balance his other faults, Laplace was quite generous 
in supporting and encouraging younger scientists. From time to time he went 
to the aid of Gay-Lussac (the chemist), Humboldt (the traveler and natu- 
ralist), Poisson (the physicist and mathematician), and Cauchy (the complex 
analyst). Laplace’s overall impact on modern mathematics has been immense, 
and his name occurs frequently in the literature. 


a 


7.6 Flow Initiated by an Impulsively Started Flat Plate 


Imagine the two-dimensional flow of a semi-infinite extent of viscous fluid, 
supported on a flat plate, caused by the motion of the flat plate in its own 
plane. Let us use cartesian coordinates with the x-axis lying in the plane of 
the plate and the y-axis pointing into the fluid. See Figure 7.7. 

Now let u(a,y,t) denote the velocity of the flow in the x-direction only. 
It can be shown that this physical system is modeled by the boundary value 
problem 


u=0 if t=0,y>0 
u=U if t>0,y=0 
u>-0 if t>0,yroou. 


Here v is a physical constant known as the kinematic viscosity. The constant 
U is determined by the initial state of the system. This partial differential 
equation is a version of the classical heat equation. It is parabolic in form. 
It can also be used to model other diffusive systems, such as a semi-infinite 
bar of metal, insulated along its sides, suddenly heated up at one end. The 
system we are considering is known as Rayleigh’s problem. This mathematical 
model shows that the only process involved in the flow is the diffusion of x- 
momentum into the bulk of the fluid (since u represents unidirectional flow in 
the x-direction). 

In order to study this problem, we shall freeze the y-variable and take 
the Laplace transform in the time variable t. We denote this “partial Laplace 
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FIGURE 7.7 
Flow of a viscous fluid on a flat plate. 


transform” by L. Thus we write 


Liu(y,t)] = U(y,s) = ie e u(y, t) dt. 


We differentiate both sides of this equation twice with respect to y—of course 
these differentiations commute with the Laplace transform in t. The result is 


7 (2) av 
Oy2) Ay?” 


But now we use our partial differential equation to rewrite this as 


But of course we have a formula for the Laplace transform (in the t-variable) 
of the derivative in ¢ of u. Using the first boundary condition, that formula 
simplifies to L[Ou/Ot] = sU(y, s). Thus the equation becomes 
OF 33 
— -—-U=0. 7.6.1 
Oy? V ( ) 


In order to study equation (7.6.1), we think of s as a parameter and of y 
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as the independent variable. So we now have a familiar second-order ordinary 
differential equation with constant coefficients. The solution is thus 

U(y, s) = A(s)eV9¥/VY + B(s)e~V5u/v 


Notice that the “constants” depend on the parameter s. Also u — 0 as y — oo. 
Passing the limit under the integral sign, we then see that U = u — 0 as 
y — oo. It follows that A(s) = 0. We may also use the second boundary 
condition to write 


co co Uu 
U(0,s) = i: u(0,t)e"* dt = i Ue“ dt = —. 
0 0 2 
As a result, B(s) =U/s. We thus know that 
U(y, s) = u . e V8U/VY | 
8 


A difficult calculation (see [KBO, pp. 164—167]) now shows that the inverse 
Laplace transform of U is the important complementary erf function erfc. Here 


we define 2 fe 
erfc(x) = = | et dt. 
Vi Jo 


(This function, as you may know, is modeled on the Gaussian distribution 
from probability theory.) It can be calculated that 


u(y, t) =U -erfe (45) 


We conclude this discussion by noting that the analysis applies, with some 
minor changes, to the situation when the velocity of the plate is a function of 
time. The only change is that the second boundary condition becomes 


u=Uf(t) if t>0,y=0. (7.6.2) 


Note the introduction of the function f(t) to represent the dependence on 
time. The Laplace transform of equation (7.6.2) is U(0,s) =UF(s), where F 
is the Laplace transform of f. Now it follows, just as before, that B(s) = F(s). 
Therefore 
—Vv8y/Vv 
U(y, 8) =UF(s)eV°¥/V” = UsF(s) - a, 
8 


For simplicity, let us assume that f(0) = 0. 
Of course L[f’(t)] = sF'(s) and so 


U(y,s) =U- Lif’ (t)] -L {exfe (5) \ 


Finally, we may use the convolution theorem to invert the Laplace transform 


and obtain 
u(y,t) =U tf. f(t —7) - erfe (=) ar} 
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a 


Problems for Review and Discovery 


A. Drill Exercises 


1. Calculate the Laplace transforms of each of the following functions. 
(a) f(t) =8+ 4e* — 5cos3t 
1 if O<t<4 
(b) g(t)=¢ 0 if 4<t<8 
elif 8<t 
2 


-t if O<t<2 
(c) m={ 4 if 2<t<oo 
0 if 0<t<4 
(d) se) ={ 3t-12 if 4<t<o 
(e) g(t) =t? — 8 + cos V2t 
(f) h(t) =e7~* cos4t +t? — et 
(g) f(t) =e~* sin V5t + t?e~* 
(h) g(t) = te’ — t?e7' + te 
(i) A(t) =sin?t 
(j) f(t) = sin 3t sin 5t 
(k) g(t) =(1-e7*? 
(1) h(t) = cosh 4t 
(m) f(t) = cos 2t sin 3t 


2. Find a function f whose Laplace transform is equal to the given expres- 
sion. 


4 


@) 2ri6 
s—2 
(0) roas 46 
4 
°) Gray 
3s —2 
rage 
2s—5 
(¢) (s+ 1l(s+3)\(s—4) 
sg? +2542 
() Go3p6r) 
(g) 3s? + 4s 
8) (@—s+2)(e—1) 
6s? — 13s +2 
(h) sine ame 
F S+ 
(i) s?+s+6 
(i) a 


(2 —A(s +2) 
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3. Use the method of Laplace transforms to solve the following initial value 


problems: 


(a) y(t) + 3ty'(t) — Sy) = 1, yO) = y/(0) = 1 
(b) y(t) + 3y’(t) — 2y(t) = —6e™™",  y(m) = 1 

(c) y(t) + 2y'(t) — y(t) = te, (0) =, y/(0) = 1 

(d) yy +y=3e™, y(0) =3,y(0) =2 

Use the method of Laplace transforms to find the general solution of 
each of the following differential equations. [Hint: Use the boundary 
conditions y(0) = A and y’(0) = B to introduce the two undetermined 
constants that you need.] 


(a) y” —5y'+4y =0 

(b) y+ 3y' + 3y = 2 

(c) y"+y'+2y=t 

(d) y” — Ty’ + 12y = te” 

Express each of these functions using one or more step functions, and 
then calculate the Laplace transform. 


0 if 0<t<2 

3 if 2<t<5 

ay LOS 4 4 if 5<t<8 
-4 if 8<t<oo 

0 if 0O<t<3 

cb) a={o, if 3<t<0o 
t if 0<t<3 

(c) A(t)=2 1 if 3<t<6 
1-t if 6<t<o@ 


B. Challenge Problems 


1. 


Solve each equation for L~'(F). 


(a) s?F(s) — 9F(s) = — 


Ta | 
(b) pF (s) + 3F(s) = G>S*3 
(c) pF(s) - F(s) = = 
(d) s*F(s) + F(s) = = 


Use the formula for the Laplace transform of a derivative to calculate the 
inverse Laplace transforms of these functions. 


(a) F(s) =In (#8) 
(b) F(s) =In (=3) 
(c) F(s) = In (344) 
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3. The current I(t) in a circuit involving resistance, conductance, and ca- 
pacitance is described by the initial value problem 


ad? dl 

—— +2— +31 =4g(t 

qe ral a(t) 
dl 

10) =8 , [(0)=0, 


where 
30 if 0<t<27 


g(t) = 0 if 2n<t<5r 
10 if 51<t<o. 


Find the current as a function of time. 


In Exercises 4—7, determine the Laplace transform of the function which is 
described by the given graph. 


4, a) 5 ay 


Petes es hee 14 


a da 3a 4a x a 2a 3a 4a x 
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C. Problems for Discussion and Exploration 
1. Define, for j a positive integer, 


0 if oe Gas 
bj(t) = 4 25 i “3 See5 
0 if $<2<oo. 


Calculate the Laplace transform of ¢;, and verify that it converges to the 
Laplace transform of a unit impulse function. 

2. Derive this formula of Oliver Heaviside. Suppose that P and Q are poly- 
nomials with the degree of P less than the degree of g. Assume that 
T1,---,Tn are the distinct real roots of Q, and that these are all the roots 
of Q. Show that 


(5) 0= = rey 


3. Let us consider a linear system controlled by the ordinary differential 
equation 
ay" (t) + by'(t) + cy(t) = g(t). 
Here a, b,c are real constants. We call g the input function for the system 
and y the output function. 
Let Y = L{y] and G = L[g]. We set 


Y(s) . 
Gls) (*) 


H(s) = 


Then H is called the transfer function for the system. Show that the 
transfer function depends on the choice of a,b,c but not on the input 
function g. In case the input function g is the unit step function u(t), 
then equation (*) tells us that 


In these circumstances we call the solution function the indicial admit- 
tance and denote it by A(t) (instead of the customary y(t)). 


We can express the general response function y(t) for an arbitrary input 
g(t) in terms of the special response function A(t) for the step function 
input u(t). To see this assertion, first show that 


Ly](s) = sL[A](s)L[g|(s) . 


Next apply the fact that the Laplace transform of a convolution is the 
product of the Laplace transforms to see that 


y(t) = < (fat vaw av) = < (f awae— av) 
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Actually carry out these differentiations and make the change of variable 
¢ =t—v to obtain Duhamel’s formulas 


< 
= 
oo 
S 
T 
as 
= 
= 
m™ 
SS 
a 
= 
wm 
as 
Q 
mm 
~ 
aN 
pa 
oo 
S 
Ss 
ss 
=) 
Ss 
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Distributions 


Schwartz functions 


Schwartz distributions 


e Cutoff functions 


Differentiation of distributions 


Fourier transform of distributions 


e Other spaces of distributions 


Structure theorem for distributions 


( 


8.1 Schwartz Distributions 


Thorough treatments of distribution theory may be found in [HOR], [KRA4]. 
Here we give a quick review. 
We define the space of Schwartz functions: 


a (2) o@ 


= (01 ---5N)sB = (Bis---oBH) 


S = {oe crn") : Pa,a(d) = sup 


xcERN 


Here 
SD = Oy ak 
Ox ra) fe F) it mA) Bn 
and 
ges £7) Bo? ae 


Observe that e~!!” € S and p(x) - e—l#l” € § for any polynomial p. Any 
derivative of a Schwartz function is still a Schwartz function. The Schwartz 
space is obviously a linear space. 
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It is worth noting that the space of C' functions with compact support 
(which we have been denoting by Co°) forms a proper subspace of S. Since 
as recently as 1930 there was some doubt as to whether C’S° functions are 
genuine functions, it may be worth seeing how to construct elements of this 
space. 

Let the dimension N equal 1. Define 


2. etal? if «>0 
Ne)={ 0 if «<0 


Then one checks, using l’H6pital’s Rule, that 4 € C™(R). Set 


h(x) = (—2 — 1) -A(a@ +1) € C®(R). 


Moreover, if we define 


then the function 
f(x) = g(x + 2) - g(-2 — 2) 


lies in C'S° and is identically equal to a constant on (—1,1). Thus we have 
constructed a standard “cutoff function” on R!. On RN, the function 


F(a) = f(a1)-+- fen) 
plays a similar role. 
Exercise: [The C© Urysohn lemma] Let K and L be disjoint compact 


sets in R%. Prove that there is a C© function ¢ on R% such that ¢ = 0 on 
K and ¢ = 10n L. (Details of this sort of construction may be found in [HIR].) 


8.1.1 The Topology of the Space S 


The functions pg,g are seminorms on S. A neighborhood basis of 0 for the 
corresponding topology on S is given by the sets 


Ne,e,m — {@: S- Pa,3(@) < ehs 


lal|<é 
[Bl <m 


Exercise: The space S cannot be normed. 


Definition 8.1.1 A Schwartz distribution a is a continuous linear functional 
on S. We write a € S’. 


Examples: 
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1. If f € L' then f induces a Schwartz distribution as follows: 


sso fofdrec. 


We see that this functional is continuous by noticing that 


| [oe )F(@) de] < sup Ifllas = C- pool), 


A similar argument shows that any finite Borel measure induces a 
distribution. 


2. Differentiation is a distribution: On R!, for example, we have 
S3 or 4(0) 


satisfies 


|¢'(0)| < sup |¢’(x)| = po,1(¢). 
«cER 
3. If fe L?,1<p<o, then f induces a distribution: 
T;:S30- | 6fdrec. 
To see that this functional is bounded, we first notice that 
[fot] < Whee ol (8.1.2) 


where 1/p+ 1/p’ = 1. Now notice that 


(1 + |2|%*")|(z)| < C(p0,0(¢) + px41,0(¢)) 


hence C 
|o(x)| < TH ayat (P0.04) + pn+io(¢)). 
Finally, 
eee 
low <¢-| f (oa) ts [p0.0(8) + pxr+io(8)]- 


As a result, (8.1.2) tells us that 


T;(¢) < Cll fll ze (p0,0(¢) + px+1,0(¢)).- 
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8.1.2 Algebraic Properties of Distributions 


(i) If a, 8 € S’ then a + @ is defined by (a + B)(¢) = a(¢) + B(¢d). Clearly 
a+ ( so defined is a Schwartz distribution. 

(ii) If a € S’ and c € C then ca is defined by (ca)(¢) = c[a(¢)]. We see 
that ca € S’. 

(iii) If » € S and a € S’ then define (a)(¢) = a(w¢). It follows that 
wa is a distribution. 

(iv) It is a theorem of Laurent Schwartz (see [SCH]) that there is no 
continuous operation of multiplication on S’. However it is a matter of great 
interest, especially to mathematical physicists, to have such an operation. 
Colombeau [CMB] has developed a substitute operation. We shall say no more 
about it here. 

(v) Schwartz distributions may be differentiated as follows: If  € S’ then 
(0/dx)° un € S' is defined, for ¢ € S, by 


a\? a\? 
aol = (—1)!4| ies ; 
(4) | (6) = (1)! (2) ‘) 
Observe that in case the distribution ju is induced by integration against a C* 


function f, then the definition is compatible with what integration by parts 
would yield. 


Let us differentiate the distribution induced by integration against the 
function f(x) = || on R. Now, for ¢€ S, 


f@ = 


—f(¢) 
= -[ séav 
2 -f- Fladayae— f f(x)o" (a) dex 
7 Jace rd! (x) dx 
—[ao(a)]§ + [ole de + [v0(e of ole 
i sy 


Thus f’ consists of integration against b(x) = —x(~.0,0] + X{0,00), Where 
fl if «wes 
XS) 0 if a«¢S 


is the characteristic function of the set S. This function is a version of the 


I 


I 
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heaviside function that we saw in Section 7.4. 


Exercise: Let Q C R% be a smoothly bounded domain. Let v be the unit 
outward normal vector field to OQ. Prove that —vxq € S’. [Hint: Use Green’s 
theorem. It will turn out that (—vya)(¢) = Jog odo, where do is area measure 
on the boundary.] 


8.1.3. The Fourier Transform 


The principal importance of the Schwartz distributions as opposed to other 
distribution theories (more on those below) is that they are well-behaved under 
the Fourier transform. First we need a lemma: 


Lemma 8.1.3 If f © S then fe S. 


Proof: Recall (see Section 7.6) that the Fourier transform converts multipli- 
cation by monomials into differentiation and vice versa. oO 


Definition 8.1.4 If u is a Schwartz distribution then we define a Schwartz 
distribution u by 


By the lemma, the definition of U makes good sense. Moreover, by 8.1.5 
below, 


[@(9)| = |u@)1< So pa,ald) 


la|+|B|<M 


for some M > 0 (by the definition of the topology on S). It is a straightforward 
exercise with 8.1.3 and 8.1.4 to see that the sum on the right is majorized by 


the sum 
C- So pa,a(9). 
ljal|+|B|<M 


In conclusion, the Fourier transform of a Schwartz distribution is also a 
Schwartz distribution. 


8.1.4 Other Spaces of Distributions 


Let D= CS and € = C®™. Clearly D C S C E. On each of the spaces D and 
E we use the semi-norms 


0 a 
NK,a() = sup |(2) (| , 
K xc 
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where K C RY is a compact set and a = (a1,...,@y) is a multi-index. These 
induce a topology on D and € which turn them into topological vector spaces. 
The spaces D’ and €’ are defined to be the continuous linear functionals on 
D and € respectively. Trivially, €’ C S’ C D’. The functional in R! given by 


b= 57246), 
j=l 


where 6; is the Dirac mass centered at the integer j, is readily seen to be in 
D’ but not in &’. 

The support of a distribution yp is defined to be the complement of the 
union of all open sets U such that (¢) = 0 for all elements ¢ of Co° that are 
supported in U. As an example, the support of the Dirac mass 6p is the origin: 
when dg is applied to any testing function ¢ with support disjoint from 0 then 
the result is 0. 


Exercise: Let yp € D’. Then pw € €’ if and only if ~ has compact support. 
The elements of €’ are sometimes referred to as the “compactly supported 
distributions.” 


Proposition 8.1.5 A linear functional L on S is a 
Schwartz distribution (tempered distribution) if and only 
if there is a C' > 0 and integers m and € such that for all 
~eS we have 


IL(P)| <C> YS" S° pa,8(4)- (8.1.5.1) 


la|<é|B|<m 


Sketch of Proof: If an inequality like (8.1.5.1) holds then clearly L is con- 
tinuous. 

For the converse, assume that L is continuous. Recall that a neighborhood 
basis of 0 in S is given by sets of the form 


Neem ={PES: S Pa,B tie 


la|<é 
[Bl <m 


Since L is continuous, the inverse image of an open set under L is open. 


Consider 
u({2 EC: |z|< i}). 


There exist €, 2,m such that 


Nein Ch (2 EC: |z|< i}). 
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Thus 
ye Pa,3(¢) <e€ 
lal<e 
[a|<m 
implies that 
|L(¢)| <1. 
That is the required result, with C = 1/e. oO 


Exercise: A similar result holds for D’ and for &’. 


Theorem 8.1.6 (Structure Theorem for D’) If u€ D’ 
then 


k 


Ue > Di ay 


j=1 


where 1; is a finite Borel measure and each D! is a differ- 
ential monomial. 


Idea of Proof: For simplicity restrict attention to R'. We know that the 
dual of the continuous functions with compact support is the space of finite 
Borel measures. In a natural fashion, the space of C! functions with compact 
support can be identified with a subspace of the set of ordered pairs of C, 
functions: f < (f, f’). Then every functional on C} extends, by the Hahn- 
Banach theorem, to a functional on C, x C,. But such a functional will be 
given by a pair of measures. Combining this information with the definition 
of derivative of a distribution gives that an element of the dual of C} is of the 
form {11 + (2)’. In a similar fashion, one can prove that an element of the 
dual of C* must have the form py + (J)! +--+ + (gyi). 

Finally, it is necessary to note that D’ is nothing other than the countable 
union over k of the dual spaces (C*)’. Oo 


The theorem makes explicit the fact that an element of D’ can depend on 
only finitely many derivatives of the testing function, that is on only finitely 
many of the 7K... 

We have already noted that the Schwartz distributions are the most conve- 
nient for Fourier transform theory. But the space D’ is often more convenient 
in the theory of partial differential equations (because of the control on the 
support of testing functions). It will sometimes be necessary to pass back and 
forth between the two theories. In any given context, no confusion should re- 
sult. 


Exercise: Use the Paley-Wiener theorem or some other technique to prove 
that if 6 € D then ¢ ¢ D. (This fact is often referred to as the Heisenberg 
uncertainty principle. In fact it has a number of qualitative and quantitative 
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formulations that are useful in quantum mechanics. See [FEF] for more on 
these matters.) 


8.1.5 More on the Topology of D and D’ 
We say that a sequence {¢;} C D converges to ¢ € D if 


1. All the functions ¢; have compact support in a single compact set 
Ko. 

2. k,o(¢; —¢) — 0 for each compact set K and for every multi-index 
Q. 


The enemy here is the example of the “gliding hump:” On R!, if w is a 
fixed C™ function and ¢;(x) = (a — j) then we do not want to say that the 
sequence {¢,;} converges to 0. 

A functional 4 on D is continuous if p(¢;) — u(¢) whenever ¢; — ¢. This 
is equivalent to the already noted characterization that there exist a compact 
kK and an N > 0 such that 


mMO<C SD neald) 


lal<N 


for every testing function ¢. 


Exercises 


1. Show that any derivative of the Dirac mass has support consisting 
of just the origin. 


Calculate the Fourier transform of the Dirac mass. 
Calculate the Fourier transform of the derivative of the Dirac mass. 
Which distribution has derivative equal to the heaviside function? 


In R°, what is the Laplacian (in the sense of distributions) of |a|~'? 


Se: sa Ee 


What is the distribution derivative of the characteristic function of 
the unit interval in R? 


7. Let f be an integrable function on R. Why is its derivative equal to 
a distribution? 


8. The function f(z) = x? is not integrable on R. But it is still a 
Schwartz distribution. Explain why. 
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a 


Problems for Review and Discovery 


A. Drill Exercises 


Set ee Ne 


Calculate the distribution derivative of the function f(a) = 271. 


What is the third derivative of the Dirac delta mass? 

Calculate all second derivatives of the Dirac delta mass in R?. 
Calculate the second distribution derivative of f(x) = || in R?. 
Calculate the distribution derivative of log |x|. 

Give three distinct examples of distributions that are not functions. 
Give an example of a function whose distribution derivative is Lebesgue 
measure dz. 

Let 7; be the Diract mass at the point 1/7 € R. Does {7;} have a limit 
in the distribution topology? If so, what is it? 


B. Challenge Problems 


7. 


8. 


Give an example of an element of D’ that is not an element of S’. 

Show that if f is continuously differentiable, and is also a distribution, 
then its calculus derivative and its distribution derivative are equal. 
Show that if f is a continuous function, and is also a distribution, then 
its classical Fourier transform and its distribution Fourier transform are 
equal. 

In the section of this book on the Fourier transform, we showed how the 
classical Fourier transform interacts with translations and dilations and 
rotations. Prove analogous results for the distribution Fourier transform. 
In the section of this book on the Fourier transform, we showed how the 
classical Fourier transform interacts with the derivative. Prove analogous 
results for the distribution Fourier transform. 

Formulate and prove a version of the fundamental theorem of calculus 
for distributions. 

Show that if two distributions on R have the same derivative then they 
differ by a constant. 


Show that the distribution derivative of a measure cannot be a function. 


C. Problems for Discussion and Exploration 


1. 


Let n > 3. Show that the fundamental solution for the Laplacian in 
R” is P(x) = c- |x|~"*? for a suitable constant c. This means that 
AI (a) = 6(x), where 6 is the Dirac function. 

Refer to Exercise 1. Show that the fundamental solution for the Laplacian 
in dimension 2 is clog |z|. 

Inequality (8.1.5.1) suggests a topology on the space of Schwartz func- 
tions. This in turn induces a topology on the space of Schwartz distribu- 
tions. Describe these topologies explicitly. 
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Refer to Exercise 3. Show that the functions 


3 (x) = J X[o,1/s) (2) 


converge to the Dirac delta mass in the given distribution topology. 
What is the closure of Cz? in the Schwartz space topology? 
Is the Schwartz space complete in the indicated topology? 
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Wavelets 


e Localization in the space and frequency variables 
e Building a custom Fourier analysis 

e The Haar basis 

e A wavelet basis 

e The wavelet transform 

e Decomposition and reconstruction 

e Applications 


e Cumulative energy and entropy 


(I 
9.1 Localization in the Space and Frequency Variables 


The premise of the new versions of Fourier analysis that are being developed 
today is that sines and cosines are not an optimal model for some of the phe- 
nomena that we want to study. As an example, suppose that we are developing 
software to detect certain erratic heartbeats by analysis of an electrocardio- 
gram. (Note that the discussion that we present here is philosophically correct 
but is over-simplified to facilitate the exposition.) The scheme is to have the 
software break down the patient’s electrocardiogram into component waves. 
If a wave that is known to be a telltale signal of heart disease is detected, then 
the software notifies the user. 

A good plan, and there is indeed software of this nature in use across 
America. But let us imagine that a typical electrocardiogram looks like that 
shown in Figure 9.1. Imagine further that the aberrant heartbeat that we wish 
to detect is the one in Figure 9.2. 

What we want the software to do is to break up the wave in Figure 9.1 into 
fundamental components, and then to see whether one of those components is 
the wave in Figure 9.2. Of what utility is Fourier theory in such an analysis? 
Fourier theory would allow us to break the wave in Figure 9.1 into sines and 
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FIGURE 9.1 
A heartbeat shown in an electrocardiogram. 


FIGURE 9.2 
An aberrant heartbeat. 


cosines, then break the wave in Figure 9.2 into sines and cosines, and then 
attempt to match up coefficients. Such a scheme may be dreadfully inefficient, 
because sines and cosines have nothing to do with the waves we are endeavoring 
to analyze. 

The Fourier analysis of sines and cosines arose historically because sines 
and cosines are eigenfunctions for the wave equation (see Chapter 10). Their 
place in mathematics became even more firmly secured because they are or- 
thonormal in L?. They also commute with translations in natural and useful 
ways. The standard trigonometric relations between the sine and cosine func- 
tions give rise to elegant and useful formulas—such as the formulas for the 
Dirichlet kernel and the Fejér kernel and the Poisson kernel. Sines and cosines 
have played an inevitable and fundamental historical role in the development 
of harmonic analysis. 

In the same vein, translation-invariant operators have played an impor- 
tant role in our understanding of how to analyze partial differential equations 
(see [KRA3]), and as a step toward the development of the more natural the- 
ory of pseudodifferential operators. Today we find ourselves studying transla- 
tion non-invariant operators—such as those that arise in the analysis on the 
boundary of a (smoothly bounded) domain in RY (see Figure 9.3). The T(1) 
theorem of David-Journé gives the most natural and comprehensive method 
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FIGURE 9.3 
A smoothly bounded domain. 


of analyzing integral operators, and their boundedness on a great variety of 
spaces. 

The next, and current, step in the development of Fourier analysis is to 
replace the classical sine and cosine building blocks with more flexible units— 
indeed, with units that can be tailored to the situation at hand. Such units 
should, ideally, be localizable; in this way they can more readily be tailored 
to any particular application. This, roughly speaking, is what wavelet theory 
is all about. 

In a book of this nature, we clearly cannot develop the full assemblage of 
tools that are a part of modern wavelet theory. [See [HERG], [MEY1], [MEY2], 
[DAU] for more extensive treatments of this beautiful and dynamic subject. 
The papers [STR] and [WAL] provide nice introductions as well.] What we 
can do is to give the reader a taste. Specifically, we shall develop a Multi- 
Resolution Analysis, or MRA; this study will show how Fourier analysis may 
be carried out with localization in either the space variable or the Fourier 
transform (frequency) variable. In short, the reader will see how either vari- 
able may be localized. Contrast this notion with the classical construction, 
in which the units are sines and cosines—clearly functions which do not have 
compact support. The exposition here derives from that in [HERG], [STR], 
and [WAL]. We also thank G. B. Folland and J. Walker for considerable guid- 
ance in preparing this chapter. 


TT 


Exercises 


1. Let 
_J N if 1<ax<1+i/N 
Onne if «<1 or r>141/N 
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for N a large positive integer. Calculate the Fourier coefficients of f using 
ideas from Section 7.1. Calculate the first five partial sums, and notice 
that each of these partial sums has a tail that extends across the entire 
interval [0, 27]. 


2. Perform the steps of Exercise 1 with 


alt @ LZ ees 
~ ) 0 if a&<1 or x>2. 


3. Let « > 0. Perform the steps of Exercise 1 with 


faye 1 if e<a<2r7-e 
T=) 9 if aw<e or x>27-e. 
How do the Fourier coefficients behave as « — 0? 


4. Perform the steps of Exercise 1 with 
f= sina if 1<a<141/N 
~ | 0 if «<1 or x>1+1/N 


for N a large positive integer. Notice that this Fourier series has a tail 
on the entire interval [0, 27]. 


SS i 
9.2 Building a Custom Fourier Analysis 

Typical applications of classical Fourier analysis are 

Frequency Modulation: Alternating current, radio transmission 


Mathematics: Ordinary and partial differential equations, analysis of linear 
and nonlinear operators 


Medicine: Electrocardiography, magnetic resonance imaging, biological neu- 
ral systems 


Optics and Fiber-Optic Communications: Lens design, crystallogra- 
phy, image processing 


Radio, Television, Music Recording: Signal compression, signal repro- 
duction, filtering 


Spectral Analysis: Identification of compounds in geology, chemistry, bio- 
chemistry, mass spectroscopy 


Telecommunications: Transmission and compression of signals, filtering of 
signals, frequency encoding 
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In fact, the applications of Fourier analysis are so pervasive that they are part 
of the very fabric of modern technological life. 

The applications that are being developed for wavelet analysis are very 
similar to those just listed. But the wavelet algorithms give rise to faster and 
more accurate image compression, faster and more accurate signal compression 
and analysis, and better denoising techniques that preserve the original signal 
more completely. The applications in mathematics lead, in many situations, 
to better and more rapid convergence results. 

What is lacking in classical Fourier analysis can be readily seen by ex- 
amining the Dirac delta mass. Because, if the unit ball of L'—thought of as 
a subspace of the dual space of C(T)—had any extremal functions (it does 
not), they would be objects of this sort: the weak-x limit of functions of the 
form N~*yx{-1/2N,1/2N] as N — +oo. That weak-x limit is the Dirac mass. We 
know the Dirac mass as the functional that assigns to each smooth function 
with compact support its value at 0: 


6:C-(RX) 3 6+ (0). 


The point comes through most clearly by way of Fourier series. Consider 
the Dirac mass 6 supported at the origin in the circle group T. Then the 
Fourier-Stieltjes coefficients of 6 are 


7 


4(j) = mal eJ* d5(t) = 1. 


TT 


Thus recovering 6 from its Fourier series amounts to finding a way to sum the 


formal series 
Co 
y 1- et 
j=—00 


in order to obtain the Dirac mass. Since each exponential is supported on 
the entire circle group, the imagination is defied to understand how these 
exponentials could sum to a point mass. (To be fair, the physicists have no 
trouble seeing this point: at the origin the terms all add up, and away from 
zero they all cancel out.) 

The study of the point mass is not merely an affectation. In a radio signal, 
noise (in the form of spikes) is frequently a sum of point masses (Figure 9.4). 
On a phonograph record, the pops and clicks that come from imperfections in 
the surface of the record exhibit themselves (on an oscilloscope, for instance) 
as spikes, or point masses. 

For the sake of contrast, in the next section we shall generate an ad hoc 
family of wavelet-like basis elements for L? and show how these may be used 
much more efficiently to decompose the Dirac mass into basis elements. 
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FIGURE 9.4 
Noise in a radio signal. 


(ree 


Exercises 
1. Define / 
N if 0<a<1/N 
fu(e)={ 0 if 2x<0 or 1/N<z. 
Show that fx — 6 (the Dirac mass) in the sense of distributions. This 
means that, if y € Co°, then 


/ fre(a) (a) dx > (0) = / (a) d6(c). 


2. Refer to Exercise 1. Let y € CS, » > 0, fydx = 1. Prove that 
Ngy(Na) — 6 as N — +00 in the sense of distributions. 

3. Refer to Exercise 2. Calculate the Fourier coefficients of Ny(Na). How 
do these coefficients behave as N — +00? 


4. Refer to Exercise 2. Calculate the Fourier coefficients of ey(ex) for € > 0 
small. How do these coefficients behave as « > 0*? 


a 


9.3. The Haar Basis 


In this section we shall describe the Haar wavelet basis. While the basis 
elements are not smooth functions (as wavelet basis elements usually are), 
they will exhibit the other important features of a Multi-Resolution Analysis 
(MRA). In fact we shall follow the axiomatic treatment as developed by Mal- 
lat and exposited in [WAL] in order to isolate the essential properties of an 
MRA. 
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1 


y= DC) 


FIGURE 9.5 
The functions y and w. 


We shall produce a dyadic version of the wavelet theory. Certainly other 
theories, based on other dilation paradigms, may be produced. But the dyadic 
theory is the most standard, and quickly gives the flavor of the construction. In 
this discussion we shall use, as we did in Chapter 6, the notation as to denote 
the dilate of a function: as f(x) = f(dxr).1 And we shall use the notation Tq to 
denote the translate of a function: ta f(x) = f(x — a). 

We work on the real line R. Our universe of functions will be L?(R), the 
square-integrable functions. Define 


~(£) = X0,1(2) = { ; : a ¢ [0,1) 


and 


W(x) = p(2x) — yp(2x—-1) = X{0,1/2) (x) = X{1/2,1) (2). 
These functions are exhibited in Figure 9.5. 

The function y will be called a scaling function and the function ~ will 
be called the associated wavelet. The basic idea is this: translates of y will 
generate a space Vo that can be used to analyze a function f on a large scale— 
more precisely, on the scale of size 1 (because 1 is the length of the support 
of y). But the elements of the space Vo cannot be used to detect information 
that is at a scale smaller than 1. So we will scale the elements of Vo down 
by a factor of 27, each 7 = 1,2,..., to obtain a space that can be used for 


1We use the notation 6 in other parts of the book to denote the Dirac delta mass. You 
should be able to tell from context which meaning of 6 is intended. 
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analysis at the scale 2~/ (and we will also scale Vo up to obtain elements that 
are useful at an arbitrarily large scale). Let us complete this program now for 
the specific y that we have defined above, and then present some axioms that 
will describe how this process can be performed in a fairly general setting. 
Now we use ¥ to generate a scale of function spaces {V;}jez. We set 


Vo = { So axlrngl : Slax? < oo, 


keZ 


for the particular function y that was specified above. Of course each element 
of Vo so specified lies in L? (because the functions 7, have disjoint supports). 
But it would be wrong to think that Vo is all of EE. for an element of Vo is 
constant on each interval [k,k +1), and has possible jump discontinuities 
only at the integers. The functions {7,y},ez form an orthonormal basis (with 
respect to the L? inner product) for Vo. 

Now let us say that a function g is in Vj if and only if ay,2g lies in Vo. 
Thus g € V, means that g is constant on the intervals determined by the 
lattice (1/2)Z = {n/2:n € Z} and has possible jump discontinuities only at 
the elements of (1/2)Z. It is easy to see that the functions { V2a27; f : f € Vo} 
form an orthonormal basis for Vj. 

Observe that Vo C Vi since every jump point for elements of Vo is also a 
jump point for elements of Vj (but not conversely). More explicitly, we may 
write 

Tr = A272 f + A2Tar+if; 


thus expressing an element of Vo as a linear combination of elements of V1. 

Now that we have the idea down, we may iterate it to define the spaces 
V; for any 7 € Z. Namely, for 7 € Z, V; will be generated by the functions 
Q2iTmy, all m € Z. In fact we may see explicitly that an element of V; will be 
a function of the form 


f= ye eX (e/25,[¢+1]/23) 
te 


where > |ae|? < oo. Thus an orthonormal basis for Vj is given by 
{25/2095 Tmp}meZ- 

Now the spaces V; have no common intersection except the zero function. 
This is so because, since a function f € NjezV; would be constant on arbitrar- 
ily large intervals (of length 2~/ for j negative), then it can only be in L? if 
it is zero. Also UjezV; is dense in L? because any L? function can be approx- 
imated by a simple function (i.e., a finite linear combination of characteristic 
functions), and any characteristic function can be approximated by a sum of 
characteristic functions of dyadic intervals. 

We therefore might suspect that if we combine all the orthonormal bases 
for all the V;,7 € Z, then this would give an orthonormal basis for L?. That 
supposition is, however, incorrect. For the basis elements y € Vo and agi Toy € 
V; are not orthogonal. This is where the function 7 comes in. 
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Since Vo C V; we may proceed by trying to complete the orthonormal 
basis {7} of Vo to an orthonormal basis for V,. Put in other words, we write 
Vi = Vo ® Wo, and we endeavor to write a basis for Wo. Let w = agp—a2T1y~ 
be as above, and consider the set of functions {7,7)} for m € Z. Then this is 
an orthonormal set. Let us see that it spans Wo. 

Let fh be an arbitrary element of Wo. So certainly h € V;. It follows that 


h = S- bj;Q2T;— 
J 


for some constants {b;} that are square-summable. Of course h is constant on 
the interval [0, 1/2) and also constant on the interval [1/2,1). We note that 


ott) = 5 [elt +¥()] on [0,1/2) 


and i 
v(t) = 5 le(t)—¥(t)] on [1/2, 1). 


nie) = (PE) oy + (AE) we 


on [0,1). Of course a similar decomposition obtains on every interval [j, 7 +1). 


It follows that 


As a result, 
h= S- CjTIP + S- djT5W, 
jEZ jEZ 
where 
Cj = ot ie and dj = iat aee 


Note that h € Wo implies that h € Vol. Also every 7;¢p is orthogonal to every 
Trew. Consequently every coefficient c; = 0. Thus we have proved that h is in 
the closed span of the terms 7;7. In other words, the functions {7;W} jez span 
Wo. 
Thus we have V; = Vo @ Wy, and we have an explicit orthonormal basis 
for Wo. Of course we may scale this construction up and down to obtain 


for every j. And we have the explicit orthonormal basis {29/?09; Tm}mez for 
each W;. 
We may iterate the equation (9.3.1); to obtain 


Vier = Vi OW; = Vj-1 6 Wj-1 © W; 
= -+:=VYeW OW, G:-:-BWj-1 BS W;j. 
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Letting 7 — +00 yields 
P=VWeQw;. (9.3.2) 
j=0 
But a similar decomposition may be performed on Vo, with W; in descending 


order: 
VY =V.10W.1=:::=V~OW27@:::- OW. 


Letting @ — +oo, and substituting the result into (9.3.2), now yields that 
LD? = QW. 
jEZ 


Thus we have decomposed L?(R) as an orthonormal sum of Haar wavelet 
subspaces. We formulate one of our main conclusions as a theorem: 


Theorem 9.3.1 The collection 


H= {2!?2aa;tm :m,j € 2} 


is an orthonormal basis for L?, and will be called a wavelet 
basis for L?. 


Now it is time to axiomatize the construction that we have just performed 
in a special instance. 


Axioms for a Multi-Resolution Analysis (MRA) 


A collection of subspaces {V;}jez of L?(R) is called a Multi-Resolution Anal- 
ysis or MRA if 

MRA, (Scaling) For each j, the function f € V; if and only if a2 f € Vj41. 
MRAg2 (Inclusion) For each j, Vj C Vj+1. 


MRAs3 (Density) The union of the V;’s is dense in L?: 


closure U Vi? = D7 (R). 
jeZ 


MRA, (Maximality) The spaces V; have no non-trivial common inter- 


section: 
() Vj = {0}. 
jeZ 
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MRA; (Basis) There is a function y such that {7;~} ez is an orthonormal 
basis for Vo. 


We invite the reader to review our discussion of » = xj(9,1) and its dilates 
and confirm that the spaces V; that we constructed above do indeed form an 
MRA. Notice in particular that, once the space Vo has been defined, then the 
other V; are completely and uniquely determined by the MRA axioms. 


i  —— —Eee 


Exercises 


1. What is a typical element of V2? What is a typical element of V_3? 

2. What is a typical element of W2? What is a typical element of W_3? 

3. Give an example of a function f such that f f(x)u(x)dx = 0 for every 
BME Vo. 

4. Give an example of a function g such that f g(x)v(x) dx = 0 for every 
vy € Wo. 

5. Verify explicitly that Vo L Wo. 

6. Verify explicitly that V; L W; for any index j. 


7. If f f(x)u(x) dx = 0 for every ps € V; for every j, then what can you say 
about f? 

8. If f f(x)v(x) dx = 0 for every v € W; for every j, then what can you say 
about f? 


rr 


9.4 Some Illustrative Examples 


In this section we give two computational examples that provide concrete 
illustrations of how the Haar wavelet expansion is better behaved—especially 
with respect to detecting local data—than the Fourier series expansion. 


EXAMPLE 9.4.1 Our first example is quick and dirty. In particular, we cheat 
a bit on the topology to make a simple and dramatic point. It is this: If we 
endeavor to approximate the Dirac delta mass 6 with a Fourier series, then the 
partial sums will always have a slowly decaying tail that extends far beyond 
the highly localized support of 6. By contrast, the partial sums of the Haar 
series for 6 localize rather nicely. We will see that the Haar series has a tail 
too, but it is small. 

Let us first examine the expansion of the Dirac mass in terms of the Haar 
basis. Properly speaking, what we have just proposed is not feasible because 
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the Dirac mass does not lie in L?. Instead let us consider, for N € N, functions 


fv = 2" x 10,1/2"): 


The functions fy each have mass 1, and the sequence {fy dx} converges, in 
the weak-* sense of measures (i.e., the weak-* topology), to the Dirac mass 0. 

First, we invite the reader to calculate the ordinary Fourier series, or 
Fourier transform, of fy (see also the calculations at the end of this exam- 
ple). Although (by the Riemann-Lebesgue lemma) the coefficients die out, 
the fact remains that any finite part of the Fourier transform, or any partial 
sum of the Fourier series, gives a rather poor approximation to fy. After all, 
any partial sum of the Fourier series is a trigonometric polynomial, and any 
trigonometric polynomial has support on the entire interval [—1, 7). In con- 
clusion, whatever the merits of the approximation to fy by the Fourier series 
partial sums, they are offset by the unwanted portion of the partial sum that 
exists off the support of fx. (For instance, if we were endeavoring to construct 
a filter to remove pops and clicks from a musical recording, then the pop or 
click (which is mathematically modeled by a Dirac mass) would be replaced 
by the tail of a trigonometric polynomial—which amounts to undesired low 
level noise (usually a hiss), as in Figure 9.6 below.) 


Figure 9.6. Undesired low-level noise. 


Now let us do some calculations with the Haar basis. Fix an integer N > 0. 
If 7 > N, then any basis element for W; will integrate to 0 on the support of 
fn—just because the basis element will be 1 half the time and —1 half the 
time on each dyadic interval of length 2~/. If instead 7 < N, then the single 
basis element y1; from W; that has support intersecting the support of fy is 
in fact constantly equal to 24/2 on the support of fy. Therefore the coefficient 
b; of 4; in the expansion of fy is 


Q-N 
65 = f fu(ouy(a) de = 2" [ 25/2 da = 25/2, 
0 


9.4. SOME ILLUSTRATIVE EXAMPLES 335 


Thus the expansion for fy is, for 0 < 2 < 27%, 


N-1 ty) N-1 
S- 29/2 145 (a) = ys g5/2 95/2 + SS g3/2 . 93/2 
j=—oo j=—o0O j=l 

= 2+(2" 2) 

= 2N 

= f(z). 


Notice here that the contribution of terms of negative index in the series— 
which corresponds to “coarse scale” behavior that is of little interest—is con- 
stantly equal to 2 (regardless of the value of N) and is relatively trivial (i.e., 
small) compared to the interesting part of the series (of size 2% — 2) that 
comes from the terms of positive index. 


If instead 2-N < 2 < 2-N+1, then pn-i(z) = —2-/? and 
bn_1pftn—1(t) = —2N-1; also 
N-2 N-2 
De bing(a) = DP 27 = 28. 
J=—00 j=-00 


Of course 6; = 0 for 7 > N. In summary, for such 2, 


S~ b;uj(«) = 0 = f(z). 


j=—00 


A similar argument shows that if 2-6 < x < 2~*+! for —oo < £< N, then 
2 bj u;(x) = 0 = f(x). And the same result holds if x < 0. 

Thus we see that the Haar basis expansion for fy converges pointwise 
to fy. More is true: the partial sums of the series give a rather nice ap- 
proximation to the function fy. Notice, for instance, that the partial sum 
Sn-1= ae n+1 0jHj has the following properties: 


(9.4.1.1) Sy_i(x) = f(a) =2%—' forO< a2 <2-%. 
(9.4.1.2) Sn_i(x) =0 for -2-% <2 <0. 
(9.4.1.3) Sy_i(a) =0 for |a| > 27%. 


It is worth noting that the partial sums of the Haar series for the Dirac 
mass 6, 


Hix (ay 2 aa), 


lgISN 


form (almost) a standard family of summability kernels as discussed in [KAT] 
(the missing feature is that each kernel integrates to 0 rather than 1); but the 
partial sums of the Fourier series for the Dirac mass 6, 


Dy(x) = > ine, 


lgISN 
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do not. Refer to Figure 9.7, which uses the software FAWAV by J. S. Walker 
([WAL]) to illustrate partial sums of both the Fourier series and the Haar 
series for the Dirac mass. 

The perceptive reader will have noticed that the Haar series does not give 
an entirely satisfactory approximation to our function fy, just because the 
partial sums each have mean-value zero (which fy most certainly does not!). 
Matters are easily remedied by using the decomposition 


=H eQw; (9.4.1.5) 
0 
instead of the decomposition 
= QW; 


that we have been using. For, with (9.4.1.5), Vo takes care of the coarse scale 
behavior all at once, and also gets the mean-value condition right. 


Figure 9.7: Partial sums of the Fourier and Haar series for the Dirac mass. 


Thus we see, in the context of a very simple example, that the partial 
sums of the Haar series for a function that closely approximates the Dirac 
mass at the origin give a more accurate and satisfying approximation to the 
function than do the partial sums of the Fourier series. To be sure, the partial 
sums of the Fourier series of each fy tend to fy, but the oscillating error 
persists no matter how high the degree of the partial sum (in the classical 
literature this is called Gibbs’s phenomenon). The situation would be similar 
if we endeavored to approximate fy by its Fourier transform. 

We close this discussion with some explicit calculations to recap the point 
that has just been made. It is easy to calculate that the j** Fourier coefficient 
of the function fy is 

“oN-1 
Fn 9) = = (#" - 1). 


wis 
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Therefore, with Sy denoting the M*" partial sum of the Fourier series, 


2 
see Se ( ) je 2™ _ 1, 


|gj|>M 


9N-1 


ju 


Imitating the proof of the integral test for convergence of series, it is now 
straightforward to see that 


C 
Ilfv — Sarllz2 © Va 
In short, || fy — Sar||z2 — 0, as M — oo, at a rate comparable to M~!/? 
that is quite slow. 
By contrast, if we let Hw = Diij<m 23/214; (where ; € Wj) then, for 
M > N —1, our earlier calculations show that 


, and 


—M-1 
Ilfv -Hullz2 = D5 2? =2-™. 


j=—00 


Therefore || fy — Haz||,2 — 0, as M — oo, at a rate comparable to 2~™/?, or 
exponentially fast. This is a strong improvement over the convergence supplied 
by classical Fourier analysis. | 


Our next example shows quite specifically that Haar series can beat 
Fourier series at their own game. Specifically, we shall approximate the func- 
tion g(x) = [cos 72] - xjo,1)(@) both by Haar series and by using the Fourier 
transform. The Haar series will win by a considerable margin. [Note: A word 
of explanation is in order here. Instead of the function g, we could consider 
h(x) = [cos 7a] - xjo,2(«). Of course the interval [0,2] is the natural support 
for a period of the trigonometric function cos ax, and the (suitably scaled) 
Fourier series of this function h is just the single term cos 7. In this special 
circumstance Fourier series is hands down the best method of approximation— 
just because the support of the function is a good fit to the function. Such a 
situation is too artificial, and not a good test of the method. A more realistic 
situation is to chop off the cosine function so that its support does not mesh 
naturally with the period of cosine. That is what the function g does. We give 
Fourier every possible chance: by approximating with the Fourier transform, 
we allow all possible frequencies, and let Fourier analysis pick those that will 
best do the job.] 


EXAMPLE 9.4.2 Consider g(x) = [cos 72] - x(0,1]() as a function on the entire 
real line. We shall compare and contrast the approximation of g by partial 
sums using the Haar basis with the approximation of g by “partial sums” of 
the Fourier transform. Much of what we do here will be traditional hand work; 
but, at propitious moments, we shall bring the computer to our aid. 
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Let us begin by looking at the Fourier transform of g. We calculate that 


an eee ne 
g(E) _— >f (ew ae, oe ers dx 
2 0 


ap 1 es —] 


2ie+m) | en) 
—e —] 
~ any 


Observe that the function g is continuous on all of R and vanishes at oo. 
The Fourier inversion formula (last section of Chapter 6) then tells us that g 
may be recovered from g by the integral 


ay f He ae. 


We study this integral by considering the limit of the integrals 


N 
nn (x) = — | GlE)e*"s dé (9.4.2.1) 


Qn Jw 


as N — +00. Elementary calculations show that (9.4.2.1) equals 


1 N co , . 
nv(2) = 5 / / g(t)et* ate de 
—N J—oco 

il ‘I 


( 
N . 
= —/ g(t) / eblt-2)§ dé dt 
2 Jo —N 
a is 4 oo 
= | a(t ae) dt 
271 Jo t-—2 g=—N 


al 
= 1 (t) 1 Cale _ ell -N)(t-2) dt 


Ont 0 : t-—2« 
1 ft tes 
= —/ g(t) 2i sin N(t — x) dt 
271 Jo t-—2 
1 f' .sinN(a-t 
= - | Qe! a 
T Jo z—t 
1 : 
—t 
= - | cos mt ENED (9.4.2.2) 
T Jo 1 


We see, by inspection of (9.4.2.2), that jy is a continuous, indeed an analytic 
function. Thus it is supported on the entire real line (not on any compact 
set). Notice further that it could not be the case that ny = O(|a|~") for 
some r > 1; if it were, then ny would be in L'(R) and then 7N would be 
continuous (which it is certainly not). It turns out (we omit the details) that 
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in fact nv = O(|2|~'). This statement says, in a quantitative way, that ny 
has a tail. 

We can rewrite the far right expression in formula (9.4.2.2) (the last item 
in our long calculation) in the form 


where 


The astute reader will realize that the kernel D n is quite similar to the Dirich- 
let kernel that arises in connection with Fourier series. A proof analogous to 
ones we considered in Chapter 6 will show that ny(a) — g(x) pointwise as 
N—o. 

Our calculations confirm that the Fourier transform of g can be “Fourier- 
inverted” (in the L? sense) back to g. But they also show that, for any par- 
ticular N > 0 large, the expression 


N * 
nn (x) = wf dee ae (9.4.2.3) 


is supported on the entire real line. (Of course this must be so since, if we 
replace the real variable « with a complex variable z, then (9.4.2.3) defines an 
entire function.) Thus, for practical applications, the convergence of ny to g 
on the support [0,1] of g is seriously offset by the fact that ny has a “tail” 
that persists no matter how large N. And the key fact is that the tail is not 
small. This feature is built in just because the function we are expanding has 
discontinuities. 

We now contrast the preceding calculation of the Fourier transform of 
the function g(x) = [cos 7a] - xj0,1)(%) with the analogous calculation using 
the Haar basis (but we shall perform these new calculations with the aid of 
a computer). The first thing that we will notice is that the only Haar basis 
elements that end up being used in the expansion of g are those basis elements 
that are supported in the interval [0,1]. For the purposes of signal processing, 
this is already a dramatic improvement. 

Figure 9.8 shows the Fourier series approximation to the function g. 
Specifically, this is a graph of the sum of 64 terms of the Fourier series created 
with Walker’s software FAWAV. Figure 9.9 shows the improved approxima- 
tion attained by 12g. Figures 9.10 and 9.11, respectively, superimpose the 
approximations 74 and nog against the graph of g. Notice that, while the 
approximations are reasonable inside—and away from the endpoints of—the 
unit interval, the “inverse” of the Fourier transform goes out of control as x 
moves left across 0 or as 2 moves right across 1. By contrast, the Haar series 
for g is quite tame and gives a good approximation. Figure 9.12 shows the 
256-term Haar series approximation—an even more dramatic improvement. 
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FIGURE 9.8 
Fourier series approximation with 64 terms to the function g. 


FIGURE 9.9 
Improved approximation with 128 terms to the function g. 
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FIGURE 9.10 
The graph of 7,23 superimposed on the graph of g. 


FIGURE 9.11 
The approximation by 1128. 
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FIGURE 9.12 
The 256-term Haar series partial sum. 


More precisely, the Haar series partial sums are supported on [0,1] (just 
like the function g) and they converge uniformly on [0,1) to g (exercise). Of 
course the Haar series is not the final solution either. It has good quantitative 
behavior, but its qualitative behavior is poor because the partial sums are 
piecewise constant (i.e., jagged) functions. We thus begin to see the desirabil- 
ity of smooth wavelets. | 


Part of the reason that wavelet sums exhibit this dramatic improvement 
over Fourier sums is that wavelets provide an “unconditional basis” for many 
standard function spaces (see [HERG, p. 233 ff.], as well as the discussion in 
the next section, for more on this idea). Briefly, the advantage that wavelets 
offer is that we can select only those wavelet basis functions whose supports 
overlap with the support of the function being approximated. This procedure 
corresponds, roughly speaking, with the operation of rearranging a series; such 
rearrangement is possible for series formed from an unconditional basis, but 
not (in general) with Fourier series. 


TT 


Exercises 


1. Calculate the Fourier series expansion of f(a”) = xj0,1)(#) - e”. Also cal- 
culate the Haar basis expansion of f. Which is a more accurate approxi- 
mation? Why? 

2. Refer to Exercise 1. Calculate the Fourier transform of f. Use Fourier 
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inversion to recover f from f. How good an approximation to you get? 
How does this compare with the results from Exercise 1? 

3. Calculate the Fourier series expansion of f(x) = x{0,1)(2). Also calculate 
the Haar basis expansion of f. Which is a more accurate approximation? 
Why? 

4. If f is continuously differentiable, then the Fourier series of f converges 
uniformly to f. Can you explain why—at least intuitively? Can you say 
something similar about the Haar basis expansion? 

5. Can you write a formula for the Nth partial sum of the Haar basis ex- 
pansion? 

6. What is the orthogonal complement of Wo in L?? 

7. What is the orthogonal complement of Vo in L?? 

8. Let X be a linear functional on Vo. Can it be represented by an integral 
formula? 

9. Let A be a linear functional on Wo. Can it be represented by an integral 
formula? 


8S 


9.5 Construction of a Wavelet Basis 


There exist examples of an MRA for which the scaling function y is smooth 
and compactly supported. It is known—for reasons connected with the uncer- 
tainty principle (Section 9.1)—that there do not exist C°° (infinitely differen- 
tiable) scaling functions which are compactly supported, or which satisfy the 
weaker condition that they decay exponentially at infinity—see [HERG, p. 
197] for a proof. But there do exist compactly supported C* (k times continu- 
ously differentiable) scaling functions for each k. In this section we will give an 
indication of I. Daubechies’s construction of such scaling functions. We begin, 
however, by first describing the properties of such a scaling function, and how 
the function might be utilized. 

So suppose that y is a scaling function that is compactly supported and 
is C*. By the axioms of an MRA, the functions {7,y}xez form a basis for Vo. 
It follows then that the functions {V/2a27,y}xez form an orthonormal basis 
for V,;. Written more explicitly, these functions have the form /2y(2x — k), 
and they span Vj. Since y € Vo C Vi, we may expand y itself in terms of the 
functions /2y(2x — k). Thus 


yp(2) = S- crv 2p(2a — k), (9.5.1) 
k 


where 


Ch = / v(x) V2 v(2x — k) da. 
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If we set 
(2) = S0(-1)*er-nV 29 (20 — k), (9.5.2) 


k 


then the functions w(x — £), @ € Z, will be orthogonal and will span Wo. To 
see the first assertion, we calculate the integral 


[ee -Bwle- Oe 
= [[cota-avFeee - 4) x [De evince da 


k L 


-- ae 2c1—Ke1-e(—1)*** i p(2a — k)- p(2a — £) daz. 
ke 


Of course the k*” integral in this last sum will be zero if k 4 & because of 
Axiom 5 of an MRA. If instead k = @, then the integral evaluates to 1/2 by 
a simple change of variable. If we mandate in advance that { |y|? = 1, then 
>>, |cx|? = 1 and the result follows. 

As for the functions w(a — 2) spanning Wo, it is slightly more convenient 
to verify that {y(@ — m)}mezU {¥ (a — £)}cez spans Vi = Vo © Wo. Since the 
functions y(2% — n) already span Vj, it is enough to express each of them as 
a linear combination of functions y(x — m) and 7(a — @). If this is to be so, 
then the coefficient an(m) of y(2x — m) will have to be 


an(m) = [ves —n)p(a — m) dx 


= [eee-m Nav3 p(2a — 2m — k) dx 


k 


= > V2 cK | p(2x — n)p(2a — 2m — k) da. 
k 


The summand can be non-zero only when n = 2m+k, that is, when k = 


n — 2m. Hence i 


An (m) = ri Cacdme 


Likewise, the coefficient b,,(m) of (2a — £) will have to be 
ban(m) = [vee — n)w(a — ) dx 
= fc —n) S0(-1)Fer_eV2p(22 — 26 — k) dx 


k 


= S- V2c1_k a; p(2a — n)p(2a — 20 — k) dx. 
k 
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Of course this integral can be non-zero only when 20+ k = n, that is, when 
k =n — 2é. Thus the ¢th coefficient is 


by (m) a = C1—n4+26- 


Thus our task reduces to showing that 


1 
= Cn—amie(& — mm) + (—1)" > Crna oxth(e — 2) 


p(2a —n) = 
If we plug (9.5.1) and (9.5.2) into this last equation, we end up with an identity 
in the functions y(2a — p), which in turn reduces to an algebraic identity on 
the coefficients. This algebraic lemma is proved in [STR, p. 546]; it is similar 
in spirit to the calculations that precede Theorem 9.3.1. We shall not provide 
the details here. 

The Haar basis, while elementary and convenient, has several shortcom- 
ings. Chief among these is the fact that each basis element is discontinuous. 
One consequence is that the Haar basis does a poor job of approximating 
continuous functions. A more profound corollary of the discontinuity is that 
the Fourier transform of a Haar wavelet dies like 1/2 at infinity, hence is not 
integrable. It is desirable to have smooth wavelets, for as we know (Chapter 
6), the Fourier transform of a smooth function dies rapidly at infinity. The 
Daubechies wavelets are important partly because they are as smooth as we 
wish; for a thorough discussion of these see [WAL] or [HERG]. 

Our discussion of the Haar wavelets (or MRA) already captures the spirit 
of wavelet analysis. In particular, it generates a complete orthonormal basis 
for L? with the property that finite sums of the basis elements give a good 
approximation (better than partial sums of Fourier series exponentials) to the 
Dirac mass 6. Since any L? function f can be written as f = f * 6, it follows 
(subject to checking that Haar wavelets interact nicely with convolution) that 
any L? function with suitable properties will have a good approximation by 
wavelet partial sums. 

The Haar wavelets are particularly effective at encoding information com- 
ing from a function that is constant on large intervals. The reason is that the 
function w integrates to zero—we say that it has “mean value zero.” Thus 
integration against ~ annihilates constants. If we want a wavelet that com- 
presses more general classes of functions, then it is natural to mandate that 
the wavelet annihilate first linear functions, then quadratic functions, and so 
forth. In other words, we typically demand that our wavelet satisfy 


[ e@e! de =o. §=0,1,2,..50—-1 (9.5.3) 


for some pre-specified positive integer L. In this circumstance we say that w 
has “DL vanishing moments”. Of course it would be helpful, although it is not 
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necessary, in achieving these vanishing moment conditions to have a wavelet 
w that is smooth. 

It is a basic fact that smooth wavelets must have vanishing moments. 
More precisely, if w is a C* wavelet such that 


Iy(x)| < C+ |al)*?, 


then it must be that 
[ eiv@) ax =o, O<j<k. 


Here is a sketch of the reason: 

Let {v%}xez be the basis generated by w for the space W;. Let 7 > 
j’. Then wi. lives on a much smaller scale than does wi. Therefore ue, is 
(essentially) a Taylor polynomial on the interval where wi. lives. Hence the 


orthogonality 
i vv}, =0 


is essentially equivalent to 
/ wi(a)a™ dx = 0 


for appropriate m. This is the vanishing-moment condition. 

It is a basic fact about calculus that a function with many vanishing mo- 
ments must oscillate a great deal. For instance, if a function f is to integrate 
to 0 against both 1 and a, then f integrates to zero against all linear func- 
tions. So f itself cannot be linear; it must be at least quadratic. That gives 
one oscillation. Likewise, if f is to integrate to 0 against 1,2,x?, then f must 
be at least cubic. That gives two oscillations. And so forth. 


Remark: It is appropriate at this point to offer an aside about why there 
cannot exist a C® wavelet with compact support. First, a C° wavelet w 
must have vanishing moments of all orders. Passing to the Fourier transform, 
we see therefore that wo vanishes to infinite order at the origin (i.e., y and 
all its derivatives vanish at 0). If ~ were compactly supported, then w would 
be analytic (see [KRA3, §2.4]); the infinite-order vanishing then forces , and 
hence w, to be identically zero. 


9.5.1 A Combinatorial Construction of the Daubechies 
Wavelets 


With these thoughts in mind, let us give the steps that explain how to use 
Daubechies’s construction to create a continuous wavelet. (Constructing a C! 
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or smoother wavelet follows the same lines, but is much more complicated.) 
We begin by nearly repeating the calculations at the beginning of this section, 
but then we add a twist. 

Imagine functions y and wv, both continuous, and satisfying 


+00 +00 +00 
/ o(e)d2=1, f joe) de=1, [ Ib(a)|2de=1. (9.5.4) 


—oo —co 7c 


We know from the MRA axioms that the function y must generate the basic 
space Vo. Moreover, we require that Vo C V;. It follows that 


o(2) = S° eV 2y(2a — j) (9.5.5) 
jEZ 


for some constants c;. The equation 


(2) = So(-lfa_jv29(2e — j) (9.5.6) 
jeZ 
defines a wavelet ~ such that {7,W} spans the subspace Wo. Notice that 
equations (9.5.5) and (9.5.6) generalize the relations that we had between y 
and w) for the Haar basis. 
Equation (9.5.5), together with the first two integrals in (9.5.4), shows 


that 
Yogev? and Se) (9.5.7) 
jeZ jeZ 
If, for specificity, we take L = 2, then equation (9.5.3) combined with (9.5.6) 
implies that 


So(-1iegp =0 and = $0 j(-1)'e; = 0. (9.5.8) 
JEZ jEZ 
In fact one can solve the equations in (9.5.7) and (9.5.8); one standard 
solution is 


peers Bea i can Eee eee (9.5.9) 


5 € — 5 ¢ eee , 
Bg OO ee OE ls aes 


and all other c; = 0. 
Now here comes the payoff. Using these values of c;, we may define 


po(x) = x>o,1)(2) 
g(x) = S > ceV 293-1 (22 —£) when j > 1. 
LeZ 


The functions yj, iteratively defined as above, converge to a continuous func- 
tion y that is supported in [0,3]. It can then be seen from (9.5.6) that the 
corresponding function w is continuous and supported in [—1, 2]. 
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Now that suitable y and w have been found, we may proceed step by 
step as we did with the construction of the Haar wavelet basis. We shall not 
provide the details, but instead refer the reader to [WAL], from which this 
particular presentation of wavelet ideas derives. 


9.5.2 The Daubechies Wavelets from the Point of View of 
Fourier Analysis 


We now give a last loving look at the Daubechies wavelet construction—this 
time from the point of view of Fourier analysis. Observe that if y is smooth 
and of compact support, then the sum in (9.5.5) must be finite. Using ideas 
from Chapter 6, we may calculate that 


nN 


(V20(2 - (« — §))) (6) = (V2a27;9) (6) = v2 41%) (E/2) = J eiit/2g(¢/2), 
3 3 


If we set 


~ LS ceité 
m(é) = 5D, ye", 


where the c; are as in (9.5.5), then we may write (applying the Fourier trans- 
form to the sum in (9.5.5)) 


P(E) = m(E/2) (6/2). (9.5.10) 


We call the function m a low-pass filter. 
Iterating this last identity yields 


P(E) = m(E/2)m(E/4)0(E/4) 


P(E) = m(E/2)m(E/4) + --m(E/2”)O(E/2”). 
Since G(0) = f{ v(x) dx = 1, we find in the limit that 


alg) = [[ mE/2”). (9.5.11) 


Now the orthonormality of the {y(a — j)} implies the identity 
Im(E)I? + Im(E +m)? = 1. (0.5.12) 


To wit, we calculate using Plancherel’s theorem (and with j,k denoting the 
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Kronecker delta) that 
bio = I playa =p de 


= = [190 2 eT td§ dé 


1 2(€+1)7r se 
= ee IA(6)[2e-W ae 
GO 2 


27 we lm 


1 Cc 20 »: 
=-—» | |G(A + 2b) |2e- 4 dd 
20 Pucaaeete st 
1 27 VY 
ge S-|G(A + 2m) |? | em ad. 
T Jo 


LEZ 


This calculation tells us that the 27-periodic function 3°, |@(u + 27)|? 
has Fourier coefficient 1 at the frequency 0 and all other Fourier coefficients 
0. In other words, 


S>|G(u + 2en)/? = 
¢ 
Using (9.5.10), we find that 


3 |O(E + Lr) |? |m(E + er) |? = 
e 


Now separating into sums over even and odd indices @, and using the 27- 
periodicity of m, yields (9.5.12). 

Running our last arguments backwards, it can be shown that if m is a 
trigonometric polynomial satisfying (9.5.12) and such that m(0) = 1, then the 
product in (9.5.11) converges uniformly on compact sets to a function @ € L?. 
Also, if @ decays sufficiently rapidly (|¢| = O(1 + |€|)~1~¢ will do), then its 
inverse Fourier transform y is the scaling function of an MRA. 

In summary, if we can find a trigonometric polynomial m satisfying 
(9.5.12) with m(0) = 1 and so that the resulting ¢ satisfies |@(g)| = 
O(1+|€|)~*-1-€ for some integer k > 0, then y will be a compactly supported 
wavelet of class C*. It should be noted that finding such a trigonometric poly- 
nomial m is hard work. 

As a final note, if yo is a “nice” function with @o(0) = 1 and if we define 
yrK inductively by 

Fe (E) = m(E/2) Raa (E/2) (9.5.13) 


then, by (9.5.11), 


K 


jim Gx () = lim Ga(€/2*) [J mE/2") = G6). 


k=1 
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This last equation defines @, and therefore y itself. Unraveling the Fourier 
transform in (9.5.13), we conclude that y = limyx, where yx is defined 
inductively by 


eK (x) = », ej V2pK-1(22 — j). 


The analysis that we have just given shows that wavelet theory is firmly 
founded on invariance properties of the Fourier transform. In other words, 
wavelet theory does not displace the classical Fourier theory; rather, it builds 
on those venerable ideas. 


Reflective Remarks 


The iterative procedure that we have used to construct the scaling function 
y has some interesting side effects. One is that y has certain self-similarity 
properties that are reminiscent of fractals. 

We summarize the very sketchy presentation of the present chapter by 
pointing out that an MRA (and its generalizations to wavelet packets and to 
the local cosine bases of Coifman and Meyer [HERG]) gives a “designer” ver- 
sion of Fourier analysis that retains many of the favorable features of classical 
Fourier analysis, but also allows the user to adapt the system to problems 
at hand. We have given a construction that is particularly well adapted to 
detecting spikes in a sound wave, and therefore is useful for denoising. Other 
wavelet constructions have proved useful in signal compression, image com- 
pression, and other engineering applications. 

In what follows are two noteworthy mathematical applications of wavelet 
theory. They have independent interest, but are also closely connected to each 
other (by way of wavelet theory) and to ideas in the rest of the book. We shall 
indicate some of these connections. One of these applications is to see that 
wavelets give a natural unconditional basis for many of the classical Banach 
spaces of analysis. The other is to see that a Calderén—Zygmund operator is 
essentially diagonal when expressed as a bi-infinite matrix with respect to a 
wavelet basis. 

We sketch some of the ideas adherent to the previous paragraph, and refer 
the reader to [DAU] for the details. 


9.5.3. Wavelets as an Unconditional Basis 


Recall that a set of vectors {e9, e1,...} in a Banach space (a complete, normed, 
linear space) X is called an unconditional basis if it has the following proper- 
ties: 


(9.5.14) For each « € X there is a unique sequence of scalars ap, a1,... such 
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that 


Co 
c= ; AFZE;, 
j=0 


in the sense that the partial sums Sy = bear aje; converge to x in the 
topology of the Banach space. 


(9.5.15) There exists a constant C’ such that, for each integer m, for each 
sequence Qo0,Q1,... of coefficients as in (9.5.14), and for any sequence 
Bo, G1,--. satisfying |G,| < |ax| for all 0 < k < m, we have 


<C>» (9.5.15.1) 


We commonly describe property (9.5.14) with the phrase “{e;} is a 
Schauder basis for X.” The practical significance of property (9.5.15) is that 
we can decide whether a given formal series Goeo + 31e1 +--- converges to an 
element y € X simply by checking the sizes of the coefficients. 

Let us consider the classical ZL? spaces on the real line. We know by con- 
struction that the wavelets form an orthonormal basis for L?(R). In particular, 
the partial sums are dense in L?. So they are also dense in L?M L?, 2 < p < o, 
in the L? topology. It follows that they are dense in L? in the L? topology for 
this range of p, since the L? norm then dominates the L? norm (the argument 
for p < 2 involves some extra tricks which we omit). Modulo some technical 
details, this says in effect that the wavelets form a Schauder basis for L?, 
1<p<o. Now let us address the “unconditional” aspect. 

It can be shown that (9.5.15) holds for all sequences {(;} if and only if 
it holds in all the special cases 3; = +-a;. Suppose that 


LP > f=> anv; 
jk 
then of course it must be that aj, = f[ f(x)vi(x) dx = (f,W)) (by the or- 


thonormality of the wavelets). So we need to show that, for any choice of 
w = {w;x} = {£1}, the operator Ty defined by 


Twf => wath, vires 


isk 


is a bounded operator on L?. We certainly know that Ty is bounded on L?, 
for 
Tw llz2 = da lesa Awe? = 2 fbb)? = WF lle. 


The LZ? boundedness will then follow from the Calderén—Zygmund theorem 
provided that we can prove suitable estimates for the integral kernel of Ty. 
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The necessary estimates are these (which should look familiar): 


C 
|k(,y)| < 
|x —y| 


and 5 Z 
ae — << ——s. 


These are proved in Lemma 8.1.5 on p. 296 of [DAU]. We shall not provide 
the details here. 


9.5.4 Wavelets and Almost Diagonalizability 


Now let us say something about the “almost diagonalizability” of Calderén— 
Zygmund operators with respect to a wavelet basis. In fact we have already 
seen an instance of this phenomenon: the kernel Kk for the operator Ty that 
we just considered must have the form 


K(2,y) = So wy rdp(a)vp(y). (9.5.16) 
jk 


Do not be confused by the double indexing! If we replace the double index 
(j,k) by the single index @, then the kernel becomes 


K(a,y) = S> werbe(x)bely) - 
t 


We see that the kernel, an instance of a singular integral kernel, is plainly 
diagonal. 

In fact it is easy to see that an operator given by a kernel that is diagonal 
with respect to a wavelet basis induces an operator that is bounded on L?. 
For let 


K(a,y) = DJ ajave(a)ey(y). 
jk 
Then the operator 
Te sf [ K(eySu)ay 


satisfies (at least at a computational level) 


Tx f(a) = Sage f, vAd}(2). 


jk 


Since {yl} forms an orthonormal basis for L?, we see that if each aj, = 1, 
then the last line is precisely f. If instead the a;,, form a bounded sequence, 
then the last displayed line represents a bounded operator on L?. In fact it 
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turns out that Tx must be (essentially) the sort of operator that is being 
described in the T(1) theorem. A few details follow: 

A translation-invariant operator T, with kernel k, is a Calderén—Zygmund 
operator if and only if it is “essentially diagonal” with respect to a wavelet ba- 
sis, in the sense that the matrix entries die off rapidly away from the diagonal 
of the matrix. To see the “only if” part of this assertion, one calculates 


Tuk wk) = ff we — Wwi@d @) dedy, 


where J is the interval that is the support of ~ and I’ is the interval that is 
the support of 7)’. One then exploits the mean-value-zero properties of yj, and 


wh, together with the estimates 


and 


After some calculation, the result is that 
where p is the hyperbolic (Poincaré) distance between the points ¢, ¢’ that are 


associated with J, I’. See Figure 9.13. We shall not provide the proofs of these 
assertions, but instead refer the reader to [MEY1], [DAU]. 


<C. ee A(S.6') 


Exercise: Calculate the matrix of the Hilbert transform 
t 
froP.v. - LO dt 
z—t 


with respect to the Haar basis. Conclude that the Hilbert transform is bounded 
on L?(R). 


Exercises 
1. Provide the details of the derivation of the formula 
g(x —n) = —= | So taaamy(@ — m) + (-1)" 50 ae (a — 8) 
meZ LeZ 


as given in the text. 
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FIGURE 9.13 
The geometry of the hyperbolic plane. 


2. Explain why a function that integrates to 0 against 1, x, x*,..., ©* must 


have a graph that oscillates at least k times. [Hint: Use induction on k.] 


3. Give an explicit example of a nonzero function that, on the interval [0, 1], 
integrates to 0 against 1, x, x”. [Hint: Think of a polynomial of degree 
at least 3.] 


4. Prove that if f is a continuous function such that 
1 . 
i; f(x)a? dx =0 for all 7 =0,1,2,... 
0 


then f =0. 
5. Show that the functions 


9; (x) = > ce 205-1 (2a — 2), 


LeZ 


as discussed in the text, converge as 7 — +00 to a continuous function y 
that is supported in [0,3]. 


6. Show that if m is a trigonometric polynomial satisfying (9.5.12) and 
such that m(0) = 1, then the product in (9.5.11) converges uniformly on 
compact sets to a function ¢ € L?. 


7. Refer to Exercise 6. Show that if @ decays sufficiently rapidly, then its 
inverse Fourier transform y is the scaling function of an MRA. 


8. Show that the Hilbert transform is bounded on L? by calculating the 
Fourier transform of the kernel 1/z. 


9. Refer to the calculations at the end of the section. Show that 
(Tw, wz.) < C . eT PLS SS ) ; 


where p is the hyperbolic metric. 
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10. Which nonnegative integers k have the property that 


/ (sin x)a" dx = 0? 


(I 
9.6 The Wavelet Transform 


The material presented up until now in this chapter should be considered to 
be an informal introduction to what wavelet theory is all about. Now we shall 
engage in some detailed calculations in order to turn wavelets into a useful 
tool. We shall conclude with some concrete applications. 

We now develop a sequence of results that lead up to a powerful tool called 
the wavelet transform. Afterward we present some compelling applications. 
Recall the basic function y that we have treated in detail above. 


Proposition 9.6.1 Define 


a(t) = 29/2. p(t —k for j,k EZ. 
Yj, 


Then the {y;,,} form an orthonormal basis for V;. 


Proof: The assertion is obvious from the discussion in Section 9.3. Oo 


Definition 9.6.2 Let 7,k € Z. Define the interval 


k k+l 


We refer to the first index j as the level of the interval. The level specifies the 
size of the interval. 


Proposition 9.6.3 Let 7 € Z. Then 


R=---U,-2UT,-1UL;,0 U1 UG 2U:::. 


Proof: Obvious. O 
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Proposition 9.6.4 Let j,k,m € Z and assume that k # 
m. Then 


Lik NM Ye — ) . 


Proof: Obvious. O 


Proposition 9.6.5 Let j,k,@,m © Z. Assume that € > 7. 
Then either Ij, em =9 or Igm C Ij,x- In the latter case, 


I¢m is contained in either the left half of I;,, or the right 
half of I;,x. 


Proof: Obvious. Oo 


Now we want to consider the projection of a function g € L?(R) into Vj. 
Denote this projection by P;[g](t). Just as in finite-dimensional linear algebra, 


we have that 
Pylg\(t) = >> (jul), 9(8)) - 9 j,e(t)- 


keZ 
We can use Proposition 9.6.1 above to rewrite this last formula as 
Pilgl(t) = SY \(24/? p(2%t — k), g(t) - 24/2 p(2%t — k) 
keZ 
= 27.9 1(y(2"t — k), g(t) - (2%t — k). 
keZ 


EXAMPLE 9.6.6 Let g(t) = e7!*!. Let us calculate P[g](t) and P_,[g](t). 
Now 
Po{g](t) = 4 > (p(4t — k), g(t) - p(4t — k). 
keZ 
We see that 
supp (2,k) = F st 
, 4’? 4 
Hence the inner produce can be written as 
TD Ae-tde iF b> 0 
(p(4t—k), g(t)) = 
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As a result, 


-1 


Pp[g](t) = 4(1 — e*) > e*/4o(4t — kb) + 4(e/4—1) So e*/4y(4t — bk). 
k=0 k=—0o 


keZ 


We know that 
supp (~-1,k) = [2k, 2(k + 1)] 


and hence y_1,x equals 0 outside that interval. The inner product is then 


2k+2 —t 7 
dt if k>0 
t/2—k),g(t)) = ee = 
(o(t/2 -), a(t) ae ee 


= e*(1—e-?) if k>0 
= e2*(e? — 1) if k<0O. 


Thus we can write the projection as 


POE (1 a) Srey (5 = k) Fe 5(e =F s Py (5 S k) 


k=0 k=—0o 
| 
Proposition 9.6.7 The V; spaces are nested as follows: 
CV 2gCV1CUWYCVWCWwC::: 
Proof: This is immediate from the definition of V;. Oo 


Proposition 9.6.8 A function f € V; if and only if f(2t) 


is an element of Vj+1. 


Proof: This is immediate from the definitions. Oo 
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Proposition 9.6.9 The spaces V; satisfy these properties: 


() Vj = {0} 


jeZ 


Uv = DPR). 
jEZ 


Proof: Already discussed in Section 9.3. Oo 


Proposition 9.6.10 Let fi © Vi be given by 


filt) = S- an p1,k(t) - 


keZ 


Then the projection Po[fi|(t) of fi into Vo is given by 


fo(t) = D> be(t — k), 


keZ 


2 
bh = ca (daz + Gor41)- 


Proof: Write 
fo(t) = Polfil(t) = S> belt — k). 


keZ 
By the definition of the projection we have 


by = (v(t — 8), A(t) = [ y(t — k) A(t) dt. 


We know that supp y(t — &) = [k,& +1] and, on this interval, the value of 
the function is 1. Hence the last integral reduces to 


k+1 
n= | filt) dt. 
k 


The function f; is piecewise constant with possible discontinuities at 
points of (1/2)Z. Notice that there are two relevant basis functions: 


Y1,m(t) = V2¢p(2t > m) with SUPP Y1,m = [k, k a 1/2] 
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and 
Yiym+r(t) = V2p(2t — (m + 1) with supp Y1,m41 = [k+1/2,k4+ 1). 


We need to determine the correct choice for m. 

Observe that the compact support of y(2t — m) is [m/2, (m+ 1)/2]. We 
need the left endpoint of this interval to equal the left endpoint of the support 
of p(t—k). Thus m/2 = k or m = 2k. As a result, yy 94(t) = p(2t — 2k) is the 
leftmost function whose support overlaps [k, k +1]. We find the other function 
by translating this last function to the right by 1/2 unit. This is 


p(2(t — 1/2) — 2k) = y(2t — (2k + 1)) = vire+1- 
We conclude that, on the interval [k,k + 1], 
fi(t) = aopy(2t a 2k) + Gon419(2t an (2k + 1)) é 


Here supp y(2t— 2k) = [k,k+1/2] and supp p(2t— (2k+1)) = [k+1/2,k+1]. 
The inner product thus becomes 


k+1 
be = / fil(t) dt 
k 


k+1 
= / (cxvBec2 —2k)+ dop41V 2y(2t —(2k+ 1)) dt 
k 


k+1 


k+1 
v3 | agp yp(2t _ 2k) dt + v3 | aon+19(2t _ (2k + 1)) dt. 
k k 


l| 


Of course y(2t—2k) = 1 on the interval [k, k+1/2] and y(2t—(2k+1)) =1 
on the interval [/ + 1/2,k+ 1]. The inner product then becomes 


k+1/2 k+1 
bh = v3 | aorp(2t _ 2k) dt + V2 agrn+iy(2t _ (2k + 1)) dt 
0 k+1/2 
k+1/2 k+1 
= vBaxx | ldt+ V2a2k41 | 1dt 
0 k+1/2 
2 
= “a (a2k + G2k41) - 
That completes the proof. oO 


The last proposition is an important calculation that will shape what we 
do below. 


Exercise for the Reader: Let 


fit) = 5¢1,0(t) + 3¢1,1(t) + 5y1,2(t) — 1,3(t) + 5¢1,4(t) + 7¢1,5(t) . (9.6.11) 
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Calculate that the projection fo of f; into Vo is 


fo = 4V/20,0(t) + 2V/20,1 (t) + 6V2y0,2(t) ‘ 


We have constructed fo as an approximation in Vo of f; € Vi. Let us now 
change our point of view and assume that we are given fo € Vo and fi € Vi 
and we want to produce a residual function go so that fo(t) + go(t) = fi(t). 

Let us find a way to describe go explicitly. 

Of course go = fi — fo. Since we know exactly what the functions on the 
right look like, we see that go is a piecewise constant function with disconti- 
nuities at 1/2,1,3/2,2,5/2. We conclude that go € Vi so go can be written as 
a linear combination of the 91 x. 

Refer to the Exercise for the Reader above. Let us focus on those par- 
ticular fo, fi. Since supp fo = supp fi = [0,3], we need only consider y,,, for 
k =0,1,2,...,5. 

Consider now fo and f; on the interval [0, 1). On this interval fo equals the 
average of the two constant values of f1. Hence go equals the directed distance 
from fo to fi on this interval. That directed distance is 1/2 on [0, 1/2) and it 
is —/2 on [1/2,1). 


We can repeat this last analysis on the intervals [1, 2) and [2,3). We obtain 
go(t) = f(t) — fot) 
= V2y(2t) — V2y(2t — 1) + 3V2¢(2t — 2) — 3V2¢y(2t — 3) 
— /2p(2t — 4) + V2y(2t — 5) 
= V2(p(2t) — p(2t — 1)) + 3V2(p(2t — 2) — y(2t — 3) 
— J2(p(2t — 4) — p(2t —5)). (9.6.12) 


Now we want to take a closer look at this function go. Let us restrict 
attention to go on the interval [0,1]. On that interval, go is V2 times the 
function w, where 


1 if O<t<3 
w(t) = y(2t) —p(2t-1)= 4 -1 if 4<t<1 (9.6.13) 
0 if t<Oorl<t<o. 


Next let us look at go on the interval [1,2). We can view this function as 
the one-unit right-translate of the w discussed in the last paragraph multiplied 
by 3V2. As a result, on [1,2), go(t) = 3V2u(t — 1). Similar reasoning shows 
that, on the interval [2,3), go(t) = —wW(t — 2). We conclude that 


go(t) = V2eb(t) + 3V2y(t — 1) — V2v(t - 2). 


We see now that there is a relationship between the coefficients of ~(t—k), 
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k =0,1,2,... in the discussion following (9.6.13) and the coefficients of f; in 
(9.6.11). If we write 


go(t) = S~ cxe(t — ) 
k=0 


where co = V2, c, = 3V2, and cp = —V/2, then the cx, obey the relation 


V2 
Ck = ey ; (Gor = Q2k41) - 


Based on this experimental evidence, we now have the following more 
general result. 


Proposition 9.6.14 Suppose that the function f;, an ele- 
ment of Vi, is given by 


fit) = So angi s(t). 
keZ 


Further assume that fo is the projection of Po[fi|(t) of fi 
into Vo. If go(t) = fi(t) — fo(t) is the residual function in 
V,, then go is given by 


go(t) = ‘2 cev(t — k) 


keZ 


where ~ is given by (9.6.13) and 


2 
Ch = “> (a2k — a2k+1) 


Proof: Exercise. O 


In the past we have been interested in the linear span of y and its integer 
translates. That is the space Vo. In the last proposition we saw that there is 
interest in the linear span of w defined in (9.6.13) and its integer translates. 
This gives rise to the space Wo (which we have already had a glimpse of in 
Section 9.3). Thus we have 


Definition 9.6.15 Let ~ be as above. We define 
Wo = span {..., H(¢+1), ¥(t), o(t-1),... }}L7(R) = span {4)(t—k) }nezML?(R). 


We call Wo the Haar wavelet space generated by the Haar wavelet function w. 
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The etymology of the word “wavelet” is now evident. Because of the prop- 
erties of the approximation fo to f; (averaging consecutive values a2, and 
a2r41), it makes sense to model the error using small waves such as ~ and its 
integer translates. 


Proposition 9.6.16 The set {w(t — k)}nez forms an or- 


thonormal basis for Wo. 


Equation (9.6.10.1) tells us that the vector 


ve. | (9.6.17) 


t 
h = "[ho, hi] = 29 


has an important role in projecting a function f; € V, into Vo. In fact we 
can rewrite the projection coefficients in (9.6.10.1) using formula (9.6.17). For 
k € Z we have 

b, =h-a*, 


where 


a =" 


2k; a2k+1] : 


We can use h to rewrite y in terms of basis functions of V,. To wit, 
p(t) = p(2t) + p(2t — 1) 


= (4) V2y(2t) + (3) V2(2t — 1) 


= hovi,o(t) + higi s(t) . (9.6.18) 


In a similar manner, we can use the vector 


= ‘[g0,91] = ad (9.6.19) 
to project a function fi; € Vi into Wo. As we see from (9.6.13), 
V(t) = y(2t) — y(2t — 1) 
= (2) V2y(2t) + (-4) V2p(2t — 1) 
= goi,o(t) + 91,1 (t) - (9.6.20) 


Equations (9.6.18) and (9.6.20) will play a major role in our development 
of wavelet theory. 
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Definition 9.6.21 We shall call the equations 


2) = Porolt) + Geral) = 92s) + 9@t-1) 
and We Wo} 
v(t) = Foro(t) — Port) = (2) — o(2t-1) 


dilation equations. The Haar function y which is used to generate these dila- 
tion equations is usually called a scaling function. 


The vectors h and g in equations (9.6.17) and (9.6.19) play a significant 
role in applications involving discrete data. In fact we shall see that h and 
g can be used to form a matrix that allows us to decompose a signal (or a 
digital image) into an approximation of the original signal together with the 
details needed to recover the original signal from that approximation. 

It is straightforward for you to check that (y(t—k), a(t —m)) = 0 for any 
integers k and m. The next proposition then follows immediately. 


Proposition 9.6.22 Suppose that f € Vo and g € Wy. 


Then (f,g) =0. 


Definition 9.6.23 Let V and W be linear subspaces of L?(IR). We say that 
V and W are orthogonal or mutually perpendicular if and only if (f,g) = 0 for 
all f € V and g € W. We write V LW. 


Definition 9.6.24 Let V and W be linear subspaces of L?(IR). We define the 
direct sum of V and W to be the space 


Veow={fit)+gqt):feVigEew}. 


If it happens that V is orthogonal to W, then we call this the direct orthogonal 
sum. 


We can think of V as a subspace of V 6 W via the mapping v  v+0 
and similarly for W. 

Since Vo L Wo, we can consider the direct orthogonal sum Vo 6 Wo. We 
have the following. 


Proposition 9.6.25 Suppose that Vo, Vi, and Wo are de- 
fined as usual. Then 


Vi=VodW). 


Proof: We know that a function f € Vi; can be written as the sum of an 
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fo € Vo and a go € Wo. Since fp and go are both elements of V,, we thus see 
that V; = Vo @ Wp. To finish the proof we need to show that if f, € V; and if 
fi is orthogonal to all the elements of Vo, then fi € Wo. 

Write fi = fotgo, with fo € Vo and go € Wo. Let h € Vo be an arbitrary 
element. Then 


0 = (f1,h) 
= (fo, h) + (go, h) 
= (fo,h) +0 
= (fo,h). 


We conclude that fo = 0. Therefore fi = go € Wo. That proves the result. 0 


We now have a well-established connection among Vo, Vi, and Wo. We 
would like to have a similar result at the jth level. So, given a function f; € V;, 
we wish to approximate it with a function f;—1 from Vj;-1. We could certainly 
do so by projection, but we need a way to measure the difference. Since V; 
is constructed from functions y(2/t — k), it makes sense that the difference 
function will be constructed from translates and dilates of the function w. 
Thus we have the following definition. 


Definition 9.6.26 Let w be the Haar wavelet function. For j,k € Z we define 


W;.n(t) = 29/? - w(25t— k). 


Proposition 9.6.27 We have that 


IIvsxll = 1. 


Proof: This is a straightforward calculation. oO 


It is also easy to see that [ w;,, dt = 0. 
EXAMPLE 9.6.28 Let us describe the support of each of these functions: 


(a) Y1,-2, 
(b) W_2,4; 
(c) w5,-5. 
For part (a) we see that 
1,9 = 2M p(2t + 2) = V2-p(2(t+1)). 


We conclude then that we are contracting the function w by a factor of 2 and 
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then shifting one unit to the left. Now suppw = [0,1]. Thus we have that 
supp #(2(¢ + 1)) = [-1,-1/2]. 

A similar calculation shows that supp q_2,4 = [16, 20]. 

Finally, we can calculate that supp q4,-5 = [—5/16, —1/4]. |_| 


Recall that 


Qi/2 if O<2@t—-k<3 
Wet) = 4 27/2 if <2t-k<1 
0 if be Ook tS... 
And this equals 
25/2 if 25k <t<2%R42-040 
25/2 if 2-5 4 2-G4+) <t <2-5p 4 2-* 
0 if t<2JIk ort >2%9k+2-4, 


As a result we have the following. 


Proposition 9.6.29 The compact support for the function 


w;,~ is the interval [2~Jk,2~9(k + 1)]. 


Proof: From the discussion preceding the statement of the proposition we 
see that a, takes the value 2//? on the interval [2~7k,2-Ik + 2-9+)] and 
takes the value 2-4/2 on the interval [27k + 2-9+) 2-Jk +274]. It is zero 
elsewhere. oO 


Definition 9.6.30 Let w be the usual Haar function. We define the vector 
space 


W; = span { .., (Qt + 1), W(27t), p(2t — 1),... \ nN L?(R) 


= span {wer — i} NL?((R)). 


keZ 


We term this space the Haar wavelet space W;. 


Proposition 9.6.31 Let W; be as in the preceding defi- 
nition. Then the set {;,x4}nez is an orthonormal basis for 


W;. 


Proof: The proof is similar to that for Proposition 9.6.1. oO 
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So we certainly know that 7; ;, is orthogonal to w;, for k # m. The next 
result gives even more orthogonality properties. 


Proposition 9.6.32 Let j,k,@,m be integers with € > j. 


Then (j,k, Wem) = 0. 


Proof: First we treat the case (= j. We see that y;,,(t) = 2//* on Ij, and 
Wjm(t) = £29/? on Ijm. If k # m, then we know that Ij~9 Ijm = 0. So 
the supports of the two functions are disjoint hence the inner product is 0. If 
instead k = m, then 


(ieotse) =f vsulQdyalt) a 


= ey Wj,4 dt 
Ij, 
=~ i; 


Now consider the case that @ > j. First note that ~;,, and qWem are 
nonzero on J;,, and Ig¢m respectively. There are two cases: 


(a) If Jj 49 Lem =, then the inner product is automatically 0. 


(b) If instead Ij,4.A Lem #9, then Ip is either entirely contained in 
the left half of J;,, or entirely contained in the right half of I;,x. 
Since ;,x)(t) = 23/2 on Ij,x, it therefore equals 25/2 on Ip m. Hence 


(05,4(t), Dem) = 24/2 Wem(t) dt =0. 


Lejm 


That completes the proof. Oo 


A related idea is the following. 


Proposition 9.6.33 Let 7, be integers with € > 7. Then 


V; L We. 


Proof: Let f € V; and g € We. Since {y;,x} is a basis for V; and {Wem} isa 
basis for We, we can write 


F(t) = So eeyz et) and g(t) = S> dmaem(t). 


keZ meZ 
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Now we calculate 
(f(),9() = [ f(t)g(t) at 
- [Seva SO dmibem(t) 


keEZ meZ 
= Yad de f viel bet) a 
keZ meZ 


But the preceding proposition tells us that each of these last integrals is 0. So 
f and g are orthogonal, as was to be proved. Oo 


Next we have a strengthening of Proposition 9.6.27. 


Proposition 9.6.34 Let j,k,?,m be integers. Then 


1 if g=landk=m, 


(Wj,4(t), Vem) = { 0 if j#LlorkAm. 


Proof: We first treat the case 7 = and k = m. Then we calculate that 
(jal, bem(t)) = f vale) at = 2?2e(ate — I? = 24 vee — AYP. 


But of course ||7)(2/t — k)||? = 2-7. So this establishes the first half of our 
assertion. 
Now if k 4 m, then there are two cases. 


(a) If Jj = @, then the inner product is zero because the functions {j,i} 
form an orthonormal basis of W;. 


(b) Say that 7 4 @. We may as well suppose that ¢ > 7. Then Ip is 
either entirely contained in the left half of J;,, or else it is entirely 
contained in the right half of Jj. If it is in the left half, then 
W;,n(t) = +23/2, But then 


(je (t), Wem(t)) = 2257 fem dt. 


We know that this last integral equals 0. 
The case k = m and j ¥ £ is handled similarly. oO 


368 CHAPTER 9: WAVELETS 


Proposition 9.6.35 Let j,¢ € Z with 7 4 @. Then W; 1 


We. 


Proof: Exercise. O 


Proposition 9.6.36 The function jo satisfies the dilation 
equation 


v2 v2 


~3,0(t) = > - pj41,0(t) + ae yjtii(t).  (9.6.36.1) 


Proof: We know that 
p(t) = p(2t) + p(2t — 1). 
We replace t by 2/t in this equation to arrive at 
y(2%t) = p(2I*"t) + y(t — 1). 
Let us multiply this last equation by 2/ /2 We obtain 
2H 9(2It) = pyo(t) = 2/2 p(2it Ms) + 29/2(2/*1t — 1), 
Now we rewrite the very last equality as 
y;0(t) = 2-1/2 91/2, 9/2 y(Q+14) 4. 9-1/2. 1/2. 95/2 y(2/+14 — 1) 


=—97-1/2, Q041)/2 6(25+14) 4-1/2, Q041)/2 ,(25+14 a) 


v2 V2 
= > Pi+i0(t) + > Pitta (t) Oo 


a 


Exercises 
1. Let g(t) = t?. Calculate P2[g](t) and P_i[g](t). 


9.7. MORE ON THE WAVELET TRANSFORM 369 
2. Let 


fit) = 3¢91,0(t) — 1,1 (t) + 2¢1,2(t) — 4¢1,3(t) + 6p1,4(t) + 2¢1,5(8) - 


Calculate that the projection fo of f1 into Vo 
3. Describe the support of each of these functions: 

(a) ¥-23 

(b) qa,-1 

(c) ws,-3 
4. Describe the Haar wavelet space W; in words. 
5. What does the formula 


eas g-1/2  ol/2 , 27/2.6(29+14) + g-V/2 91/2 , 27/2 (29414 iy 
= 9-1/2, QUtD/2g(g5+14) 4 9 1/2. Qt0)/2 p(gitte — 1) 


J2 J2 
= > ¥5+1,0(t) + > Pi+11(t) 


have to do with the vector h? 


6. Thespaces W; are mutually orthogonal but the spaces V; are not. Explain 
the significance of this fact. 


7. Explain why the moment conditions f ~(x)2’ dx = 0 arise and why they 
are significant. 


8. What particular properties set the Daubechies wavelet apart from the 
Haar wavelet? 


9. Is it important that there is no C° Daubechies wavelet? Can you use 
physical reasoning? 

10. Explain why the space W; is needed to complete the space V; to Vj+1. 
What is V; lacking? 


a 


9.7 More on the Wavelet Transform 


The next result generalizes the last proposition in the last section. 
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Proposition 9.7.1 For j,k € Z we have 


V2 


J2 
Pie(t) = = + P5-41,0K(t) + + 5 41,2K41(8) - 


2 2 


Proof: We know that 


viet) = 29/2 p(2t — k) 


= #3(¢(-4) 


Now we shall replace t by t — k/2) in equation (9.6.36.1) to obtain 

k; v2 RN. Ae k; 

Ep PO or gg or ge ee baa 
(9.7.1.1) 


~5,n(t) = 95,0 (« - 


The next step is to expand the term ;+1,0(t — k/2’): 


k 
H)/24 ( oit1 (4 _ © 
ev" (5) 


j+1 
= +D/2y (2 aS *) 


| 
N) 
= 


yj+i0(t — k/2?) 


yy 


= 2040/25 (23+44 — 2k) 
05 +1,2k() - 


l| 


Expanding the term 9j41,1(t — k/2’) in a similar fashion yields 


gjriilt—k/24) = aGtV/2y Ga (« = =) = i) 
: ‘ 9I+1p 
— 9(5+1)/2 Stl, = 
= 2 Y (2 [A 1) 
= 2070/29 (25414 — 2k — 1) 
= 20+1)/2y (25+1¢ — (2k + 1) 


Pj+1,2k+1(t) 


Now substituting these last two calculations into (9.7.1.1) gives the de- 
sired result. oO 
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Next we have some analogous results for ~. The proofs of these results 
are left as an exercise. 


Proposition 9.7.2 Let j,k € Z. Then 


W5,K(t) = se * Pj41,2K(t) — a j+1,2k-+41(t) - 


2 


In particular, or k = 0, we have 


V;,0(t) = se - pi41,0(8) — _ - pj+1,1(t) - 


We have formerly considered projecting functions from V; into Vo. Now 
we generalize these ideas to projecting functions from V;+1 into V;. 


Proposition 9.7.3 Let fj41 € Vj41 be defined by 


fisi(t) = y: am ?j,m(E) - 
meZ 
Further assume that h is the vector defined in (9.6.17). Then 
the projection vector f; = P;|f;41](t) of fj+1 into V; is given 
by 


Flt) = Do beg r(t) = Sofi), Gin) ei) » 


keZ keZ 


where b;, is given by 


“9. (dor + @2e41) =h-a". 


Here we have a® = "lax, a2n41]. 


Proof: We write 
be = (fy41 (4), 97,46) 


= ~ AmYj+1,m(t), eat 


meZ 


— ye am (Qj41,m(t), p5,K(t)) : 


meZ 
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We now use the dilation equation for y;, to write 


be = D> am(yj41,m(t), 95,4(t)) 


meZ 


= SS ae | Ojzienlt); M2 ase a V2 pects 
2 2 


meZ 


v2 v2 
=F Damm (Gtr (t), Pi+1.26(f)) + FD am (e741 (Es P7-+1,2841 (1). 
meZ meZ 


The functions ¥j+41,m(t) and ~j+1,2«(t) in the inner product in the third line 
above are elements of an orthonormal basis for V;,,. Thus the only nonzero 
inner product occurs when m = 2k. Thus the first term in that line reduces to 
(/2/2)-ag,. Similar reasoning shows that the second term in that line reduces 
to (2/2) - d2z41. That ends the proof. oO 


Next let us think about projecting an element of V;+1 into W;. 


Proposition 9.7.4 Assume that fj41 € Vj41 is given by 


fia) => ane: 


meZ 


Further suppose that f; = P;(fj;+1](t) is the projection of 
fj+1 into V;. Let g be given by (9.6.19). If g;(t) = fj4i(t) — 
f;(t) is the error term in V;41, then g; € W; and is given 
by 


g(t) = D> ends n(t)- 


keZ 


Furthermore, 


v2 — 
Ck = wp (42k — 2n41) = g:a. 


Here ak = “laak, a2p+1]- 


Proof: This proof hinges on the compact support properties of yj,, and 
Yj+1,k- We shall analyze f;,1 — f; on an interval-by-interval basis. 

The basis elements yj, that are used to build f; are nonzero on the in- 
tervals Ij, = [k/27,(k + 1)/27). Such an interval has length 2-4. The basis 
elements Yj+1,, that are used to build f;,, are nonzero on intervals that are 
half as long. If we are going to analyze f;,, — f;, then we must consider the 
larger intervals [k/2),(k+1)/27) and we also must consider the two subinter- 
vals [k/27,(k + 1/2) /27) and [(k + 1/2)/2?, (k + 1)/2?). 
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We next must identify the corresponding basis functions. The basis func- 
tions for the interval [k/2/,(k + 1/2)/2%) will assume the value 24+1)/? for t 
satisfying 


k = k+ 1/2 


opr os 
Let us multiply these inequalities through by 2/ to obtain 


; 1 


Now multiply this new string of inequalities through by 2 and then subtrace 
2k from each term. The result is 


0<2+1¢-Ok <1. 


It is now apparent that the basis function we seek is ~j+1,2%- In a similar 
manner we may see that the basis function that equals 20+")/? on the interval 
[((k + 1/2)/2,(k+1)/2) is Yj41,2e41. We also know that y;, takes the value 

95/2../9 ; 

ae ‘(axa = 20-7 ( 
on the interval [k/27,(k + 1)/27). In conclusion, on the left half-interval 
[k/27,(k + 1/2)/2%), the function f;+1 — f; has the value 


2k + A2k41) (9.7.4.2) 


DOTY Pg = 20-D/2 (ay, + d2n41) = 25/7 a5,(2'/? = oti) = 2O-D/2 gory 
= 20-D/? (ay, — aee41)- 


By contrast, on the right half-interval [(k + 1/2)/23, (k+1)/2/), the func- 
tion fj;41 — f; assumes the value 


BO? Gopi = 20-D/2 (ao, + G2r41) = 25/2 aon44(2'/? = got) = 90 UP ao 


= —20-)/2(a5, —@on41). (9.7.4.3) 


Recall that supp Wj, = [k/2’, (k + 1)/2/]. Write 
25/2 if k/2) <t < (k+1/2)/29 
. — 93/2 jp pp) — ; = ; 
Re ete) { 25/2 if (k-41/2)/27 <b < (RE 1/2. 


Finally, we can use (9.7.4.2) and (9.7.4.3) to summarize our findings on 
(k;/2, (K+ 1)/2). We have 


fizi) — £6) 
_ f 20-D/? (aan — arns1 if k/27 <t < (k+1/2)/2! 
~ L -20-DP (aon —aone1 if (R+:1/2)/27 <b < (bh +1)/23 


the alates 95/2 if k/25<t < (k+1/2)/23 
Qe Ak+1)) _os/2 if (k-41/2)/29<t < (k+1)/2) 


, (a2k = d2n41)W;,4(t) ¢ 


“le lS 
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That completes the proof. oO 


It is an easily verified fact that g;, as defined in this last proposition, 
is the projection of fj;41 into Wj. It follows that any function fj41 € Vj41 
can be written as the sum of an element f; € V; and an element g; € W;. 
Furthermore, f; is the projection of f;,, into V; and g; is the projection of 
fj41 into W;. As the next proposition shows, V;;; can be constructed from 
V; and W;. 


Proposition 9.7.5 Suppose that V;, W;, and V;+1 are as 
usual. Then 


Vj41 = Vj BW; . 


Proof: Similar to the proof of Proposition 9.6.25. oO 


9.7.1 Summary 


We now know that every f;+1 € Vj41 can be written as the sum of an approz- 
imation function f; € V; and a residual function g; € W;. The functions f; 
and g; are orthogonal to each other. Also, if 


fisr(t) = So devysie(t), 


keZ 


f(t) = Do onvjrra(t), 


keZ 
and 


Also, for k € Z, 


2° 2 
and 
k _t 
a” ="[aer, a2n41], 
we have 
b, = h-a*® and Ch =g-a*. 


The very last two formulas will be the basis for creating a discrete Haar wavelet 
transformation that can be used to process digital signals and images. 
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a 


Exercises 
1. Show that fp Wem(t) li _, Pem(t) dt = 0. 
2. Plot each of these functions. ‘Give a verbal description of the compact 


support of each function. 


(a) w2,-4(t) 

(b) -3,5(t) 

(c) ya,s(t) 

Plot each of these functions. 

(a) f(t) = 2u(¢/8 — 2) — 4u(t/8 + 1) + 3¥(¢/8-+1) 

(b) g(t) =D. 5- Y(l6e +3) 

(c) h(t) = Dj--ssin(ws)v(t/2 + J) 

(d) k(t) = ote, W(2t + 5)) - P(2t + 9) 

We know that V; C Vj+1 for each j. Is something similar true for the 
W;? 

Let j be an integer and let W; be as usual. Prove that the set {(2t + 
k)}nez is a linearly independent set in Wj. 


Use the u-substitution from calculus to establish that 

(a) (b(2’t+ Jj), p(2’t+k)) =0 for j#k 

(b) |jw(2’t + &)|| =2-7? 

Show that the error term in Proposition 9.7.4 is simply the projection of 
fj4+1 into W;. 

Prove Proposition 9.7.2. 

Prove Proposition 9.7.5. 


aS 


9.8 


Decomposition and Its Obverse 


In this section we learn how to iterate the decomposition process described in 
the last section. The key is repeated use of the last proposition. 
As an illustration, consider the space V;. We can write 


Vs = Vs © Wi. 


But V4 = V3 6 W3 hence 


Vs = V3 BW3 GW. 
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Continuing this reasoning, we can write 


V5 = VadBW4 
= V30W3 0 W4 


= YWEOWEW, CW2EW30W,. 


EXAMPLE 9.8.1 Suppose that fs € V3 is given by 


7 
fs) = S an¢s,n(t) 
k=0 


= 3y3,0(t) + y3,1(t) — 23,2(t) + 4¢3,3(t) + 5y3,4(t) + Y3,5(t) 
—2¢93,6(t) — 4y3,7(t) - 


Recall that y3,4(t) = 23/2,9(8t — k). Hence the coefficients are each multi- 
plied by 2/2. Our goal here is to decompose f3 into an approximation function 
fo € Vo and details functions go € Wo, g1 © Wi, go € Wa. 

We know that V3 = Vo @ Wo. Hence fs = fo+ go with fo € V2 and 
g2 © W2. Also fo is the projection of f3 into V2 and gz is the projection of fs 
into W2. Thus we use Proposition 9.7.3 to obtain fz and we use Proposition 
9.7.4 to obtain gz. Since only ao, a1,...,a@7 are nonzero in our expression for 
fs, formula (9.7.3.1) tells us that the only nonzero coefficients in 


fo(t) = So beyon(t) 


bez 
are 
bo = V2 (ag + a1) = Y(8 + 1) =2V7, 
b= Vag + aa) = WA-2+4) = V3, 
by = VP (as + a5) = Y(5 +1) =3V3, 
bg = Vag +07) = 2(-2- 4) = 33, 


In a similar fashion, we can use (9.7.4.1) to calculate the nonzero coeffi- 
cients c;,, for 


g2(t) = >) ced2,e(t) 


keZ 
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We obtain 
co = V2 ag — a1) = Y2G-1) = v2, 
c= VEO eae SR 
eS VP (ag — a5) = Y(5 — 1) = 2V3, 
C3 = V2 (ag — a7) = (24.4) = V3, 


Next we use Propositions 9.7.4 and 9.7.5 to create f; € V; and gi € Wi 
from fo € V2. The nonzero coefficients for f; are 


tp = avis v3) =3 and , = Pav —3v3) =o. 
The nonzero coefficients for g; are 
oo = Zeov3—- V8) =1 and a = 2 (av3- (-3v3)) <6. 
At last we project f; € Vi into Vo and Wo. We have 
and a = 28-0) = 5 v3. 


Hence 


We have decomposed fs as follows: 


f(t) = fo(t) + go(t) + gi(t) + get). al 


In applications it is often useful to begin with a function fj+z, € Vj+r 
and then perform L decompositions so that we can write 


fjtn = fy +95 + 941 +++ + 954L-1- 


In other circumstances we might be given the decomposition (on the right) 
and be asked to recover f;+,. We now develop some formulas for performing 
these operations. 
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Proposition 9.8.2 Let 


fist) = Do anyyti a(t). 


keZ 


Assume that 


filt)= So depja(t) and — gj (t) = D> cavs a(t) 


keZ keZ 


are the projections of fj41 into V; and W; respectively. Then 


v2 V2 
ao. = > (outer) =hd* and agai = “> (bn —cx) =g-d*. 


Here h, g are as usual and d* = *[bp, cx). 


Proof: Suppose that f; € V; and g; € W; are obtained by projecting f;41 € 
Vj+41 into V; and W; respectively. Propositions 9.7.3 and 9.7.4 tell us that we 
can write fj41 = >, @kYj+1,k as 


fia) = S- be G;,n(t) + CrP; n(t) (9.8.2.1) 


keZ 


In order to reconstruct f;,; from f; and g;, we need to be able to write the 
dy in terms of the 6; and the cz. 
The two indicated propositions tell us that 


2 
br — “a (2k + A2k+41) and Ch = “a (ae a a2k+1) : 


Let us look at a single term from (9.8.2.1). Both y;,, and w;,, are nonzero on 
[k/23,(k + 1)/2?). Indeed, on the half-interval [k/27,(k + 1/2)/27), both y;.% 
and ~;,, assume the value 24/2. So simplification of these two functions on the 
half-interval gives 


beips,u(t) + cxrbj,s(t) = 0429/7 + 04,297 
= 29/?(by, + cx) - (9.8.2.2) 


We have already seen that the only function at level 7 + 1 with support 
on the interval [k/23,(k + 1/2)/27) is ~j41,2%. On this interval, this function 
takes the value 2U+))/?, Hence 


‘ ) (9.8.2.3) 


fii) = 29+V/2q5, for te E oR 
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Comparing (9.8.2.2) and (9.8.2.3), we find that 
29/2 (by + cp) = 29FY/P ao, . 
As a result, 
A2k = oy nO) 
A similar analysis, which we leave to you, shows that, on the right-half 


interval [(k + 1/2)/2’,(k + 1)/27), we have the identity 


2 
Q2k+1 = mk — Cr). 


That completes the argument. oO 


We close this discussion by doing a reconstruction. 


EXAMPLE 9.8.3 In the last example we decomposed a function f3 € V3 into 
components fo € Vo, go © Wo, gi € Wi, and go © We. We found that 


3/2 3/2 
fo(t) = lt) and go(t) = > be). 

If now we add and subtract bo) = co = 3V2/2 and scale the result by 
J2 /2, then we obtain 3 and 0. These are exactly the coefficients of f1. 

If instead we take bp = 3 and b; = 0 and combine these with the coeffi- 
cients co = 1 and c; = 6 of gi, then we get 


WO (by +00) = v2 (341) =2V2. 
2 i 2) - Sosiae 

Ym +a) = ve (0+6) = 3v2. 
Ym — a1) = v (0-6) = -3v3. 


These numbers are the coefficients of fg € Vo. If we combine these with the 
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coefficients of gg © W2, then we have 


2 
(bo + €o) 


Voy + V3) =3. 


2 
2 
2 =m) V2 ay3 - V3) =1. 
2 +1) VAG (—3/2)) = -2. 
Ne bs 2500) V3 — (-3V/2)) =4. 
WO on +2) V? (av3+2Vv3) =5. 
SF ite) v2 gy3 23) =1, 
WO oy + ca) v2 _3V3-+ v3) =, 
SF ie) V2 (-3v3- v5) =-4, 
These last, of course, are the coefficients of our original fz € V3. a 


Exercises 


1. 


Calculate the sum fo(t) + go(t) + g1(t) + go(t) in Example 9.8.1 and verify 
that it equals f3. [Hint: Consider this sum on each of the subintervals 
(j,9 + 1/8) for 7 = 0,1,2,...,7. On each of these intervals, compute the 
sum and check that it equals a;.] 


Refer again to Example 9.7.1. Decompose each of the following functions 
in V2, into elements fo, go, g1 from the spaces Vo, Wo, Wi respectively. 


(a) f = 2¢3,0 — 5y2,1 + 22,5 


(b) I= ey, p2,5 


(c) h= Vid + Dyas 


Complete the proof of Proposition 9.8.2 by verifying that 


Q2k+1 = > (be — Ck). 


[Hint: Equate the terms given by beyj,n + CePj,n and fj+1.] 


For each of the functions in Exercise 2, use Proposition 9.8.2 and the 
ideas in Example 9.8.3 to reproduce f2 from the functions fo, go, and gi. 


9.9. SOME APPLICATIONS 381 


5. Find fo and go for each of the following functions from Vj: 

(a) fi = 5¢1,0 — 2y2,2 + 5y2,4 

(b) fi = 10¢1,0 — 10y1,1 — 5y1,3 + 41,5 

(c) fi = 4¥1,0 + 3¢1,1 + 8¢1,2 — 291.5 

(d) fi =a Fes 
6. Prove that the set {y(t — j)}j;ez is a linearly independent set in Wo. 
7. Show that 

(a) fo u(t)dt =0 

(b) 


(We — B)(e=H)) =f ' eS 


(c) (p(t —k), b(t —39)) = Sp v(t — k)b(t — 5) dt =0 
8. Show that the function g; in Proposition 9.7.4 is actually the projection 
of fj+1 into Wj. 


9.9 Some Applications 


We have developed considerable wavelet machinery in the preceding pages, 
and now it is time to see how these tools can be used. 
Recall that 


t 
h = "[ho, hi] = Ba | 
and i 
g v2 v2 
&=‘[90,9] = 2-3] 


EXAMPLE 9.9.1 Consider the function f3 from Example 9.8.1. We saw that 
ay = 0 for k <0 or k > 8. Furthermore, 


ao = 3, a=1, ag=-2, a3 =A 
aa=5, a,=1, ag=4, a7 = —4. 


To project f3 into V2, we use Proposition 9.7.3 to write 


by = ha? = hasta] | %° - Pe) [tJ 


bh =heal = [ho,h]-| % | = be A) [2 = 


a3 a> 4 
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bz = h-a? = [ho, hi] - | i | 7 bs 2 


by = hea? = fhovta)-| 2° | = le I -| ; J=0. 


a7 


We can naturally formulate these computations as the matrix product 
Hya=b or 


ao 
ay 
ho hy 0 0 0 0 0 0 ay bo 
0 0 ho hy 0 0 0 0 a3 _ by 
Dele SO hake OO 0 as | | be 
0 0 0 0 0 0 ho ma as bs 
a6 
a7 
or 
3 
vi VB ; 
w2wo 0 0 0 0 0 ] =) [ 22] 
Qa) AE A 0) “20 4 |_| 2 
O° <0 “20s War Ba aon 5 5 ba 
§ 0:0 0 0 90 32! ; 0 


(9.9.1.1) 
In the same fashion we can compute the detail coefficients (formerly known 
as residual or error terms) using the matrix equation G4a = c or 


ao 

ay 
go 9 9 0 0 0 0 O a2 Co 
0 0 Go 91 0 0 0 0 a3 = Cl 
0 0 0 0 Go 91 0 0 a4 = ca 
0 0 0 0 0 0 Go 91 as C3 

a6 
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or 


3 
ee A | a ee: Se | Se 
ge ae -2 V2 
0 0 @-4 0 0 0 Oo 4 |_| -3v2 
QO 0: <0) 0) 2 a. ge’ 5 2/2 
O!. 80S. OS 0." 362 40" 2 : 4V2 
4 
| 
(9.9.1.2) 
We can concatenate the matrix equations (9.9.1.1) and (9.9.1.2) to write 
b 
qaa= |], 
Here 
Fg 
a = [34] 
hip Aig 0-0) O10 OD 0 
0 0 hb m 0 0 0 0 
OF 0 0. Oe Fig he 0% 620 
a | OE Oe HO 20210. 208 Fags ah 
~ |g gg 0 0 0 0 0 0 
0 0 gw gm 0 0 0 0 
0 0 0 0 go g 0 0 
0000 0 0 w wu 


— | 


o°o ods 
— 


SJo oO on 
RorecS 


| 
bo 


ro) OK © o onlso 
| 
Jd 2 colon 
ie) 

Reo ooh 

NO 


rr rs | rr | 
I 


| 
2 ongeclo onlgo 
oly co clonyo o 


oo oN 

oo°o 

ro) 

IS o =) onyo oO 


bo 


Now let us take a closer look at the matrix Qs. We see that the inner 
product of the first row with itself is 1. In fact the inner product of any row 
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with itself is 1. And the rows are orthogonal to each other. We conclude that 


Qs is an orthogonal matrix, so that Q7' = 'Qs. 
If we form the vector 
b 
ho = ) 


where b and ¢ are defined as in the last example, then we see that 


‘asy = Qs [2 
fe 9 9 Of 0 oO 0 
Zo 0 o0o|-2@ o 06 0 
OQ: 22g OS OG 
2 oO a SO iO A eg 
re (Re eae Ce oa | 
0 0 @o]/0 0 -2 0 
0. 26. yr 2) ot 
GO Op 2) sO". ee 
bo 
by 
be 
bs 
Co 
C1 
C2 
C3 
Bin + Be 
eee 
2h. 1 V2 
de 
Bin + Zen 
By — Be, 
Big + Bey 
By — Be 


9.9. SOME APPLICATIONS 385 


V2 (94/2) + 2 (4/2) 

2 (2/2) — (V2) 
¥2(/2) + 2 (-32) 
_— | Bv2)- BC 3v2) 
v2 (3/2) + (2/2) 
¥2(3,/2) — 2(2v2) 

(0) + 2(4v2) 

(0) — 2(4v2) 


— 
rw 
— | 


We see that the matrix Qg gives us a way to take a vector of finite length 
and decompose it. Since Qg is orthogonal, we can use ‘Qs to recover the 
original vector. Now we can define the discrete Haar wavelet transformation 
matrix. 


Definition 9.9.2 Let N be an even positive integer. We define the discrete 
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Haar wavelet transformation to be 


a ee 0 0 

0 09 2 0 0 

Tis OF 0. AO AG. uae Se ee 

Qn = ae = I a oF 89.2.1) 
N/2 er ae ae 0 0 
V3 
a: “4: Ao sae 0 0 
G> 10: 30) M0: ge 2212 


The N/2 x N block, which we have called Hyg, is called the averages block 
and the N/2 x N block, which we have called G'y/z, is called the detatls block. 


We can apply the matrix Hy/2 to a vector a to see why it is called the 
averages block. We calculate 


waw@o o Oi - 6 a0 
0 09 2 0 0 ne 
Ayja = te 
an—-2 
0 0 0 0 ww gaa 
ao 
pezo0 oc o7 | & | 
we Ors0l eo 00; 
ae ae 
00 0 0 a nee 
aora1 
2 
a27a3 
2 
— J2- 


an-—2+tan-1 


We see then that Hy 2a calculates pairwise averages of consecutive values of 


a and then weights the result by VJ/2. 
Now, at last, we shall study the application of the Haar wavelet transform 
to a problem of noise-level estimation. 


EXAMPLE 9.9.3 Let 
yo=vtre. 


Here v is the true signal and e is a noise vector. In practice we do not know 
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Vv or e; instead we are given y and we want to estimate v. A key step in this 
process is estimate the noise level o that occurs from e. In this example, e is 
composed of independent samples from a normal distribution? with mean 0 
and variance a”. Such noise is called Gaussian white noise. 

Now assume that y € R%. We apply the discrete Haar wavelet transform 
to obtain 


| = Qny — Qn(v +e) = Qnv+Qne. 


Here s = Hy zy is the approximation part of the transform and d = Gy zy is 
the detail part of the transform. Because Q, is an orthogonal matrix, it can 
be seen that the elements of the vector Qye are normally distributed with 
mean 0 and variance o?. As Donoho and Johnstone point out in [DOJ], the 
main portion of the transformed noise @ye ends up in the detail portion d. 
So the vector d is an excellent candidate for estimating o. 

Hampel [HAM] showed that the median absolute deviation (MAD) of a 
sample can be used to estimate a. Let Q@meq denote the median of samples 
a= ‘lai, a2,...,an]. Then we define the MAD to be 


MAD(a) = median(|a1 — Qmea|; |@2 — Gmeal,---;|@N — @meal) - (9.9.3.1) 


Hampel showed that 
MAD(a) => 0.6745- a 


as the sample size tends to +00. Combining this result with the fact that most 
of the noise in Qyy resides in d gives us the following estimate o fo a: 


__ MAD(d) 
7 = 0.6745 


We now illustrate this wavelet-based method for estimating the noise level 
of a signal by considering the signal formed by evaluating the heaviside func- 
tion and adding some white noise to it.? We define 


h(t) = 4sin(4zt) — sgn(t — 0.3) — sgn(0.72 —t) for t€ [0,1]. 
Here we recall that 


1 if t>0 
sgn(t) = 4 0 if t=0 
—1 if t<0O 


R2048 


We form the vector v € using the formula 


k 
=h| —— for k =0,1,2,...,2047. 
Uk (=) or 0,1,2,...,2047 


?If this terminology from statistics is unfamiliar to you then you should consult a basic 
statistics textbook like [DES]. 

3Recall that the heaviside function is that function which is equal to 0 on the interval 
(—oo, 0] and equal to 1 on the interval (0, +00). 
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Next we create a noise vector e whose components e;, for k = 0,1,2,..., 2047, 
are independent samples from a normal distribution with mean 0 and variance 
o?. Just to be specific, we shall take o = 0.5. 

The first thing we do is to calculate the discrete Haar wavelet transform 
of y. We use (9.9.2.1) for this purpose. The 1024-vector s = Hio2ay (that is, 
the averages) and the detail vector d = Gio2ay result. 

Using (9.9.3.1) we calculate the median absolute deviation of d and find 
that MAD(d) = 0.356043. By Hampel’s result, cited above, the noise level a 
can be estimated by dividing this number by 0.6745. Thus 

~ 0.356043 
TRO = oe = 0.537862. 
The absolute error in this approximation is jo — o| = 0.027862 and the per- 
centage error is about 05.57%. | 


2 


Exercises 


1. Compute the discrete Haar wavelet transformations for each of these 
vectors: 
(a) 26 liad) 
(b) v= "(1,2,3,4,5, 6, 7,8) 
(c) w=‘(1,4,9, 16, 25, 36, 49, 64) 

2. Compute three iterated discrete Haar wavelet transforms for each of the 
vectors in Exercise 1. 


3. Let v € RX, where N is an even, positive integer. Show that 


U1 v2 
U3 V4 

XS : 
UN-1 UN 


These ideas lead to an efficient algorithm for calculating Qnv. 


4. Suppose that y = Qwv, where N is a positive, even integer. Then it is 
possible to recover v using the inverse transform v = ‘Quy. Prove this 
assertion. 
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Can you reformulate this matrix product in terms of the vectors h, g and 


Y1 YN/2+1 
Yy2 YN/2+2 
Y= ; ? 
YN/2 YN 


5. Define 


Qn, = diag[Q y 2: ; Iy/23 , Iyyjoi-1 sees t/a, In 2] . 


Show that the jth iteration of the discrete Haar wavelet transform is 
given by 
y’ = Qn,j-2Qn,j-2- +++ Qn 1Qnov- 


6. Refer to Exercises 5 for notation. We define 


Qn = Qn 2Qn1Qn,0, 


where N = 2?, p> 3 is a positive integer. Let v € R®°. We wish to apply 
three iterations of the discrete Haar wavelet transform to v. We may 
write 


y® = Qsv = Qa.2Q8,1Qaov. 
(a) Show that 


a Z VO Vo Ve ve WB 

4 4 4 4 4 4 4 4 

Wave _va _VB VE VR VR VB 

4 4 4 4 4 4 4 

-} -- $ $ 0 0 0 0 

_ 0 0 0 9 -$ -E g 4 
y = sVv= “Vv 

~-2 2 9 0 O fs 20F BD 

0 @ =2 2 0. 0 oO 0 

0 0 0 G. =a 2 oO 

0 0 0 0 OO: =a 


(b) Redo part (a) where now we take v € R’° and we calculate four 
iterations of the discrete Haar wavelet transform. What is Qie in 
this case? 


(c) Describe the general form of Qn, where N = 2?. 


7. For the vectors g and h defined as usual, show the following. 


(a) The Fourier series constructed from h is given by 


H(w) = a + va = V2e™ cos(w/2). 
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(b) Using part (a), show that 
|H(w)| = V2cos(w/2) for —m<w<r. 


(c) The Fourier series constructed from g is given by 


G(w) = _ 2 = —V2ie™ sin(w/2). 


(d) We have that 
|G(w)| = V2|sin(w/2)| for —t<w<r. 
(e) We may use parts (b) and (d) to show that 
HW)? + (Hw +7)? =|G@)? +|G@+n))? =2 


and 


H(w)-G(w) +H(w+7)-Giwt+r)=0. 


Dr 


9.10 Cumulative Energy and Entropy 


We now introduce two ideas from physics that we shall use to measure the 
effectiveness of discrete wavelet transformations in applications. The first of 
these, cumulative energy, is a vector-valued function that returns information 
about how energy is stored in the input vector. The second idea, entropy, is 
used to measure the performance of image-compression algorithms. The source 
of this example is [RUV]. 

The concept of entropy is particularly interesting because it uses the log- 
arithm function in a nice fashion. 


Definition 9.10.1 (Cumulative Energy) Let v ¢ RY with v 4 0. Assume 
that y is the vector formed by taking the absolute value of each component 
of v and ordering the resulting nonnegative numbers from largest to smallest. 
Then we define the cumulative energy vector C(v) to be a vector in RY whose 
components are given by 


for j=1,2,...,N. 


Certainly ||v|| = ||y||—just by definition. We also know that 
lvl? =i +uat--- +N. 


Hence we may write 


ee ee 
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ae Yi + 9B 
yt tua to tun | 


of tygt + YN a 
CVn = 
wre te TUN 


? 


_ Mita t +n 


pap ee ee 
yt +92 +--+ Yin 


C(v)Nn 


Thus we see that C(v),; is simply the percentage of the jth largest component 
(in absolute value) of v in ||v||. Finally observe that 0 < C(v); <1 for each 


Jj. 


EXAMPLE 9.10.2 Let us find the cumulative energy of each of the following 
vectors. 


(a) u= (1,2,3,4,5,6, 7,8) 

(b) v = (1,1,1,1,1,1,1,1) 

(c) w= (v2, V2, v2, V2,0.0,0,0) 
For part (a), we calculate that 


|| ul]? = 1? + 2? + -8? = 204. 


The components of C(u) are then 


j-1 2 
8-£ 
C(u); = ) ( aa for j=1,2,...,8 
£=0 


Hence 


204’ 204’ 204’ 204’ 204’ 204’ 204’ 
For part (b), observe that ||v|/? = 8 and 


64 113 149 174 190 199 203 
(u) = 


Therefore 


For part (c), notice that 


lw]? =2+2+2+24+0+0+04+0=8 
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FIGURE 9.14 
Cumulative energy for v and for w. 


The components of Cw) are then 


f. : : 
(V2)? 27 we 
C(w);= >> 25. eralized 


8 8 
e=1 
and 
co ae 8 — 
C = —4)-0=-=1 f = 5,6,7,8. 
(w); De 8 9 ) 8 1 J ied ee 
As a result, 


123 
C(w) => (Fp pphdd it) - 


Examine Figure 9.14. It shows the cumulative energy for v (denoted by 
black discs) and for w (denoted by black squares). We see that the energy is 
uniformly distributed among the elements of v. But, for w, most of the energy 
is stored in the four largest components. | 


So the cumulative energy vector can be viewed as telling us which com- 
ponents might be important contributors to a signal. In applications such as 
image compression, those components not deemed important (i.e., with small 
cumulative energy) can be assigned a zero value. This simplifies calculations 
with very small loss of information. 

Now, as we have seen, cumulative energy gives us a vector that tells us how 
the energy of a vector is distributed; by contrast, entropy tells us the average 
amount of information contained in each unit of measure. For instance, in the 
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case of digital images made up of pixels, the unit of measure is typically a bit. 
In conclusion, the entropy of a digital image (stored as a vector) tells us, on 
average, how many bits are needed to encode each pixel.* 


Definition 9.10.3 (Entropy) Let v = (v1, v2,...,vn) be an N-vector. Sup- 
pose that the v; assume & distinct values, 1 < k < N. Denote these distinct 
values by a1, d2,...,a% and let p(a;) be the relative frequency of a; in v. That 
is to say 0 < p(a;) < 1 is the number of times a, occurs in v divided by N. 
Then the entropy of v is defined to be the quantity 


k 


Ent(v) =} | p(ae) - loga(1/p(ae)) - 


f=1 


We think of v now as the first row of a digital image. Then the entries of 
v are nonnegative integers ranging from 0 (black) to 255 (white). The term 
logs(1/p(ae)) is an exponent that is measuring the power of 2 that is needed 
to represent 1/p(a¢). This exponent can also be viewed as a bit length. We 
multiply this exponent by the relative frequency of ag in v and sum over all 
distinct values ag. As a result, entropy gives an average of the number of bits 
that are needed to encode the elements of v. 

Claude Shannon (1918-2001) was one of the real pioneers of information 
theory. In 1948 he showed that the best compression rate (in bits per pixel, 
for instance) that we can hope for when performing lossless compression on 
v, as the length of v tends to infinity, is Ent(v). 


In effect, wavelet analysis has caused harmonic analysis to re-invent itself. 
Wavelets and their generalizations are powerful new tools that allow localiza- 
tion in both the space and phase variables. They are useful in producing 
unconditional bases for classical Banach spaces. They also provide flexible 
methods for analyzing integral operators. The subject of wavelets promises to 
be a fruitful area of investigation for many years to come. 


a 


Exercises 
1. Let ce R, c #0. Let N be an even positive integer. Specify the vector 
v € R® componentwise by vj =e, 7 =1,2,...,N. Then 


(a) Compute one iteration y of the discrete Haar wavelet transform of 
v. 
(b) Find Ent(v) and Ent(y). 


4Remember that each pixel will have a color assigned to it and an intensity assigned to 
it. This is information carried by a certain number of bits. 
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(c) Compute the cumulative energy vectors C(v) and C(y) and plot the 
components of each vector on the same set of axes. 

2. Redo Exercise 1 for the vector v € RY, where v; = mj+b. Herem,b ER 
with m £ 0. 

3. Let v € R™ andc € R with c ¥ 0. Define w = cv. Show that C(v) = 
C(w) and Ent(v) = Ent(w). 

4. Prove that Ent(v) > 0 with equality if and only if v is a constant vector 
with v; =c for j =1,2,...,N andceR. 

5. Let v € R'®. Here we shall investigate the effect of quantizing the aver- 
ages portion of the iterated Haar wavelet transform. 


(a) We can write the jth iterate of the Haar wavelet transform as 


yr 


lla 
1 y” 
Y= | Sea 
S| 
We replace the averages portion y!* with the zero vector 0 € R® to 
obtain 
=| 
a eee 
yo 


Next calculate the inverse transform of y' to obtain ¥. How many 
components of Vv are the same as the corresponding components of 
v? Can you give an explicit description of those components that are 
different? 


(b) Redo part (a), but now perform two iterations of the Haar wavelet 
transform and replace y?* with 0 € R*. 


(c) Redo part (a), but now perform three iterations of the Haar wavelet 
transform and replace y®** with 0 € R?. 


(d) Redo part (a), but now perform four iterations of the Haar wavelet 
transform and replace y* with 0. 


a 


Problems for Review and Discovery 
A. Drill Exercises 
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1. 


10. 


11. 


Calculate the Fourier transform of the function 


N if 0<2<1/N 
0 if a<O0orl/N<a. 


Apply Fourier inversion to obtain functions that converge back to f. 
Notice that each of the approximants has a long tail. 


Let y be a C° function which is nonnegative and such that f p(x) dx = 1. 
Consider the function y.(«) = ep(ex). Calculate the Fourier transform 
of ye. How does it behave as € > Ot? 


What is a typical element of V5? What is a typical element of V_4? 
What is a typical element of W7? What is a typical element of W_4? 


Calculate the Fourier series expansion of f(x) = xj0,1](@) - cos 2x. Also 
calculate the Haar basis expansion of f. Which is a more accurate ap- 
proximation? Why? 

Refer to Exercise 1. Calculate the Fourier series of f. How good an ap- 
proximation to f does this series give? 


Give an explicit example of a nonzero function that, on the interval [0, 1], 
2:13 


integrates to 0 against 1, x, x”, «°, «+. [Hint: Think of a polynomial of 
degree at least 5. Is there a polynomial of degree 4 that will do the job?] 
Let g(t) = t®. Calculate P2[g](t) and P_1[g](t). 


Let 
fi(t) = 291,0(t) — 3¢1,1(t) + 5¢1,2(t) — 2¢91,3(t) + 4y1,4(0) + Y1,5()- 


Calculate that the projection fo of f1 into Vo 
Plot each of these functions. Give a verbal description of the compact 
support of each function. 


(a) ~1,-3(t) 

(b) ~-2,4(t) 

(c) ws,6(t) 

Plot each of these functions. 

(a) f(t) = 3p(t/8 — 2) — 2¥(t/8 + 1) + W(t/8 + 1) 
(b) g(t) = Wy 5 V(8t + 5) 

(c) A(t) = O5__, cos(mj)W(t/4 + J) 

(d) k(t) = eo (e7*, Y(t — 5)) - P(t — 9) 


B. Challenge Problems 


1. 


What can you say about the discrete Haar wavelet transform of the func- 


tion f(x) = a? ? 


2. Let f € Co°(R). Then it is true that 


If) < Cw (A+ [EI 


for every positive integer N. Explain why. [Hint: Think in terms of in- 
tegration by parts.] 
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Refer to Exercise 2. Is something similar true for the discrete Haar wavelet 
transform? 


Let f be a continuous function on [0,27]. Let 0 < a < 1. Suppose that, 
for each positive integer N, there is a trigonometric polynomial py such 
that 


sup |f(«)—pn(z)|<C-N™. 
x€ [0,27] 


Prove that f is Lipschitz of order a. 


Let 0 < a < 1. Discuss the behavior of the Hilbert transform on the 
Lipschitz space of order a. 


The Hilbert transform is not bounded on the space of integrable functions 
on the real line. Explain why. 


The Hilbert transform is not bounded on the space of bounded functions 
on the real line. Explain why. 


Find fo and go for each of the following functions from V1: 


(a) fi = 3¢1,0 — 5y2,2 + 22,4 

(b) fi = 891,0 — 591,1 — 21,3 + 31,5 

(c) fi = 2¢1,0 + 2¢1,1 + 2¢91,2 — 3¢1,5 

(d) fi= UR eens 

Compute the discrete Haar wavelet transformations for each of these 
vectors: 

(a) u=*(1,2,1,2, 1,2, 1,2) 

(b) v = ‘°(2,1,4,3,6,5, 8, 7) 

(c) w= “(1,8, 27, 64, 125, 216, 343, 512) 

Compute three iterated discrete Haar wavelet transforms for each of the 
vectors in Exercise 9. 


C. Problems for Discussion and Exploration 


1. 


Produce a tertiary theory of wavelets modeled on the binary Haar 
wavelets that we discussed in the text. 


Discuss pointwise convergence for the discrete Haar wavelet transform. 
Give a sufficient condition on a function f for its discrete Haar wavelet 
transform to converge uniformly. 

Show that every integrable function is the limit, in the topology of dis- 
tributions, of functions in C?°. 


10 


Partial Differential Equations and 
Boundary Value Problems 


e Boundary value problems 
e Ideas from physics 

e The wave equation 

e The heat equation 

e The Laplacian 

e The Dirichlet problem 

e The Poisson integral 


e Sturm—Liouville problems 


a 


10.1 Introduction and Historical Remarks 


In the middle of the eighteenth century much attention was given to the 
problem of determining the mathematical laws governing the motion of a 
vibrating string with fixed endpoints at 0 and a (Figure 10.1). An elementary 
analysis of tension shows that, if y(x,¢) denotes the ordinate of the string at 
time t above the point x, then y(,t) satisfies the wave equation 

Oy 20°y 


———: =. 
Ot? Ox? 

(see Sections 2.5, 2.8). Here a is a parameter that depends on the tension of 

the string (in fact a is also the velocity of the traveling solutions of the wave 

equation). A change of scale will allow us to assume that a = 1. (A bit later 


we shall actually provide a formal derivation of the wave equation. See also 
[KRA3] for a more thorough consideration of these matters.) 
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= 0 ae 


FIGURE 10.1 
The wave equation. 


In 1747 d’Alembert showed that solutions of this equation have the form 


sane s(s+2) +0(t-.), (10.1.1) 


where f and g are “any” functions of one variable. (The following technicality 
must be noted: the functions f and g are initially specified on the interval 
(0, 7]. We extend f and g to [—7, 0] and to [z, 27] by odd reflection. Continue 
f and g to the rest of the real line so that they are 27-periodic.) 

In fact the wave equation, when placed in a “well-posed” setting, comes 
equipped with two initial conditions: 


(i) y(x,0) = g(a) 
(ii) Ay(x,0) = (a). 


These conditions mean (i) that the wave has an initial configuration that is 
the graph of the function y and (ii) that the string is released with initial 
velocity w. 

If (10.1.1) is to be a solution of this initial value problem then f and g 
must satisfy 


(f(@) + g(—2)) = 9(@) (10.1.2) 


OH es 


and 


5 (F(a) + 9!(-2)) =v). (10.1.3) 


Integration of (10.1.3) gives a formula for f(x) — g(—a). That and (10.1.2) 
give a system that may be solved for f and g with elementary algebra. 

The converse statement holds as well: for any functions f and g, a func- 
tion y of the form (10.1.1) satisfies the wave equation (Exercise). The work 
of d’Alembert brought to the fore a controversy which had been implicit in 
the work of Daniel Bernoulli, Leonhard Euler, and others: what is a “func- 
tion”? (We recommend the article [LUZ] for an authoritative discussion of the 
controversies that grew out of classical studies of the wave equation. See also 
[LAN].) 

It is clear, for instance, in Euler’s writings that he did not perceive a 
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function to be an arbitrary “rule” that assigns points of the range to points 
of the domain; in particular, Euler did not think that a function could be 
specified in a fairly arbitrary fashion at different points of the domain. Once a 
function was specified on some small interval, Euler thought that it could only 
be extended in one way to a larger interval. Therefore, on physical grounds, 
Euler objected to d’Alembert’s work. He claimed that the initial position of 
the vibrating string could be specified by several different functions pieced 
together continuously, so that a single f could not generate the motion of the 
string. 

Daniel Bernoulli solved the wave equation by a different method (separa- 
tion of variables, which we treat below) and was able to show that there are 
infinitely many solutions of the wave equation having the form 


yp; (x,t) =sinjxcosjt, j > 1 an integer. 


Proceeding formally, he posited that all solutions of the wave equation satis- 
fying y(0,t) = y(a,t) = 0 and O:y(x, 0) = 0 will have the form 


Co 
y= 5 aj; sin jx cos jt. 
j=l 


Setting t = 0 indicates that the initial form of the string is f(z) = 
yt a;sinjz. In d’Alembert’s language, the initial form of the string is 
4(f(x) — f(—«)), for we know that 


0= y(0,t) = f(t) + g(t) 


(because the endpoints of the string are held stationary), hence g(t) = —f(t). 
If we suppose that d’Alembert’s function is odd (as is sin jx, each j), then the 
initial position is given by f(x). Thus the problem of reconciling Bernoulli’s 
solution to d’Alembert’s reduces to the question of whether an “arbitrary” 
function f on [0,7] may be written in the form )°>* , aj sin jz. 

Since most mathematicians contemporary with Bernoulli believed that 
properties such as continuity, differentiability, and periodicity were preserved 
under (even infinite) addition, the consensus was that arbitrary f could not be 
represented as a (even infinite) trigonometric sum. The controversy extended 
over some years and was fueled by further discoveries (such as Lagrange’s 
technique for interpolation by trigonometric polynomials) and more specula- 
tions. 

In the 1820s, the problem of representation of an “arbitrary” function by 
trigonometric series was given a satisfactory answer as a result of two events. 
First, there is the sequence of papers by Joseph Fourier culminating with 
the tract [FOU]. Fourier gave a formal method of expanding an “arbitrary” 
function f into a trigonometric series. He computed some partial sums for 
some sample fs and verified that they gave very good approximations to f. 
Second, Dirichlet proved the first theorem giving sufficient (and very general) 
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conditions for the Fourier series of a function f to converge pointwise to f. 
Dirichlet was one of the first, in 1828, to formalize the notions of partial sum 
and convergence of a series; his ideas certainly had antecedents in the work 
of Gauss and Cauchy. 

For all practical purposes, these events mark the beginning of the math- 
ematical theory of Fourier series (see [LAN]). Refer to our Chapter 6 as you 
read along. 


Math Nugget 


The Bernoulli family was one of the foremost in all of the 
history of science. In three generations this remarkable Swiss 
family produced eight mathematicians, three of them out- 
standing. These in turn produced a swarm of descendants 
who distinguished themselves in many fields. 

James Bernoulli (1654-1705) studied theology at the in- 
sistence of his father, but soon threw it over in favor of 
his love for science. He quickly learned the new “calculus” 
of Newton and Leibniz, became Professor of Mathematics 
at the University of Basel, and held that position until his 
death. James Bernoulli studied infinite series, special curves, 
and many other topics. He invented polar coordinates and 
introduced the Bernoulli numbers 


10.2. 


EIGENVALUES AND THE VIBRATING STRING 


that appear in so many contexts in differential equations 
and special functions. In his book Ars Conjectandi he for- 
mulated what is now known as the law of large numbers (or 
Bernoulli’s theorem). This is both an important philosoph- 
ical and an important mathematical fact; it is still a source 
of study. 

James’s younger brother John (Johann) Bernoulli (1667— 
1748) also made a false start by first studying medicine and 
earning a doctor’s degree at Basel in 1694 with a thesis 
on muscle contraction. He also became fascinated by calcu- 
lus, mastered it quickly, and applied it to many problems 
in geometry, differential equations, and mechanics. In 1695 
he was appointed Professor of Mathematics at Groningen in 
Holland. On James Bernoulli’s death, John succeeded him in 
the chair at Basel. The Bernoulli brothers sometimes worked 
on the same problems; this was unfortunate in view of the 
family trait of touchiness and jealousy. On occasion their 
inherent friction flared up into nasty public feuds, more re- 
sembling barroom brawls than scientific debates. 

In particular, both James and John were solvers of the 
celebrated brachistochrone problem (along with Newton and 
Leibniz). They quarreled for years over the relative merits 
of their different solutions (John’s was the more elegant, 
James’s the more general). John Bernoulli was particularly 
cantankerous in his personal affairs. He once threw his own 
son (Daniel) out of the house for winning a prize from the 
French Academy that he himself coveted. 

Daniel Bernoulli (1700-1782) studied medicine like his 
father, and took a degree with a thesis on the action of 
the lungs. He soon yielded to his inborn talent and became 
Professor of Mathematics at St. Petersburg. In 1733 he re- 
turned to Basel and was, successively, professor of botany, 
anatomy, and physics. He won ten prizes from the French 
Academy (including the one that infuriated his father), and 
over the years published many works on physics, probability, 
calculus, and differential equations. His famous book Hy- 
drodynamica discusses fluid mechanics and gives the earliest 
treatment of the kinetic theory of gases. Daniel Bernoulli 
was arguably the first mathematical physicist. 
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a 


10.2 Eigenvalues, Eigenfunctions, and the Vibrating 
String 


10.2.1 Boundary Value Problems 


We wish to motivate the physics of the vibrating string. We begin this discus- 
sion by seeking a nontrivial solution y of the differential equation 


y+ Ay =0 (10.2.1) 
subject to the conditions 
y(0) =0 and y(7) =0 (10.2.2) 


on the interval [0,7]. Here . is a fixed constant. 

Notice that this is a different situation from the one we have studied in 
earlier parts of the book. In Chapter 2 on second-order linear equations, we 
usually had initial conditions y(xo) = yo and y'(a%9) = y1. Now we have what 
are called boundary conditions: we specify a condition (in this instance the 
value) for the function at two different points. For instance, in the discussion 
of the vibrating string in the last section, we wanted our string to be pinned 
down at the two endpoints. These are typical boundary conditions coming 
from a physical problem. 

The situation with boundary conditions is quite different from that for 
initial conditions. The latter is a sophisticated variation of the fundamental 
theorem of calculus. The former is rather more subtle. So let us begin to 
analyze. 

First, if A < 0 then any solution of (10.2.1) has at most one zero. So 
it certainly cannot satisfy the boundary conditions (10.2.2). Alternatively, 
we could just solve the equation explicitly when » < 0 and see that the 
independent solutions are a pair of exponentials, no linear combination of 
which can satisfy (10.2.2). 

If \ = 0 then the general solution of (10.2.1) is the linear function y = 
Ax + B. Such a function cannot vanish at two points unless it is identically 
Zero. 

So the only interesting case is A > 0. In this situation, the general solution 
of (10.2.1) is 

y= Asin Vix + Boos Vix. 


Since y(0) = 0, this in fact reduces to 
y = Asin Vx. 


In order for y(7) = 0, we must have VAm = nm for some positive integer n, 
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x= 0 X= T 


FIGURE 10.2 
The string in relaxed position. 


thus \ = n?. These values of are termed the eigenvalues of the problem 
(refer to Section 4.1), and the corresponding solutions 


sing, sin2z, sin3dz... 


are called the eigenfunctions of the problem (10.2.1), (10.2.2). 
We note these immediate properties of the eigenvalues and eigenfunctions 
for our problem: 


(i) If ¢ is an eigenfunction for eigenvalue , then so is c- y for any 
constant c. 

(ii) The eigenvalues 1,4,9,... form an increasing sequence that ap- 
proaches +00. 


(iii) The nth eigenfunction sin nz vanishes at the endpoints 0,7 (as we 
originally mandated) and has exactly n — 1 zeros in the interval 
(0,7). 


10.2.2. Derivation of the Wave Equation 


Now let us re-examine the vibrating string from the last section and see how 
eigenfunctions and eigenvalues arise naturally in a physical problem. We con- 
sider a flexible string with negligible weight that is fixed at its ends at the 
points (0,0) and (7,0). The string is deformed into an initial position y = f (x) 
in the x-y plane and then released. See Figure 10.1. 

Our analysis will ignore damping effects, such as air resistance. We assume 
that, in its relaxed position, the string is as in Figure 10.2. The string is plucked 
in the vertical direction, and is thus set in motion in a vertical plane. We will 
be supposing that the oscillation has small amplitude. 

We focus attention on an “element” Az of the string (Figure 10.3) that 
lies between x and x + Ax. We adopt the usual physical conceit of assuming 
that the displacement (motion) of this string element is small, so that there 
is only a slight error in supposing that the motion of each point of the string 
element is strictly vertical. We let the tension of the string, at the point x at 
time t, be denoted by T(x, t). Note that T acts only in the tangential direction 
(i.e., along the string). We denote the mass density (mass per unit length) of 
the string by p. 

Since there is no horizontal component of acceleration, we see that 


T(a + Aa, t) -cos(6 + Ad) — T(x, t) - cos(@) = 0. (10.2.3) 
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FIGURE 10.3 


An element of the plucked string. 


(Refer to Figure 10.4: The expression T'(x)-cos(x) denotes H(x), the horizontal 
component of the tension.) Thus equation (10.2.3) says that H is independent 
of x. 

Now we look at the vertical component of force: 


T(a+ Ag, t) -sin(@ + A@) — T(a,t) -sin(@) = p- Axv-ue(%,t). (10.2.4) 


Here @ is the mass center of the string element and we are applying Newton’s 
second law—that the external force is the mass of the string element times 
the acceleration of its center of mass. We use subscripts to denote derivatives. 
We denote the vertical component of T(x) by V(x). Thus equation (10.2.4) 
can be written as 
V(a + Az, t) — V(a,t) fe) 
— Lay FF - 
Ag P° Uttle, 
Letting Ax — 0 yields 
Vi (a, t) = p- Utn(a, t). (10.2.5) 
We would like to express equation (10.2.5) entirely in terms of u, so we 
notice that 
V(a,t) = A(t) tané = A(t) -uz(a,t). 
(We have used the fact that the derivative in x is the slope of the tangent line, 
which is tan 6.) Substituting this expression for V into (10.2.5) yields 


(Huz)« = P° Ute. 
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‘ H =T cos 9 
FIGURE 10.4 
The horizontal component of the tension. 
But H is independent of x, so this last line simplifies to 
A + Ure = Pp: Utt- 


For small displacements of the string, @ is nearly zero, so H = T cos 
is nearly T. We are most interested in the case where T is constant. And of 
course p is constant. Thus we finally write our equation as 


Use = Utt- 
p 


It is traditional to denote the constant T’/p on the left by a?. We finally arrive 
at the wave equation 
A Une = Utt- 


10.2.3 Solution of the Wave Equation 


We consider the wave equation 


A You = Yet (10.2.6) 
with the boundary conditions 

y(0,t) =0 
and 

y(m,t) =0. 


Physical considerations dictate that we also impose the initial conditions 


Oy 


me = 10.2.7 
OF | 5-6 q ( ) 
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(indicating that the initial velocity of the string is 0) and 


y(x, 0) = f(x) (10.2.8) 


(indicating that the initial configuration of the string is the graph of the 
function f). 

We solve the wave equation using a classical technique known as “sepa- 
ration of variables.” For convenience, we assume that the constant a = 1. We 
guess a solution of the form y(z,t) = v(x) - w(t). Putting this guess into the 
differential equation 

Yau = Ytt 
gives 


p(x) b(t) = p(x)" (t) 
We may obviously separate variables, in the sense that we may write 
g'(z) _ wv" 
p(t) W(t) 


The left-hand side depends only on x while the right-hand side depends only 
on t. The only way this can be true is if 


gi) _,_ wv) 
ala) WO) 


for some constant ». But this gives rise to two second-order linear, ordinary 
differential equations that we can solve explicitly: 


gp’ =rA-.” (10.2.9) 
pl =r-o. (10.2.10) 


Observe that this is the same constant » in both of these equations. Now, 
as we have already discussed, we want the initial configuration of the string to 
pass through the points (0,0) and (7,0). We can achieve these conditions by 
solving (10.2.9) with y(0) = 0 and y(z) = 0. But of course this is the eigen- 
value problem that we treated at the beginning of the section. The problem 
has a nontrivial solution if and only if \ = —n? for some positive integer n, 
and the corresponding eigenfunction is 


n(x) = sinna. 
For this same \, the general solution of (10.2.10) is 
v(t) = Asinnt+ Bcosnt. 


If we impose the requirement that 2’(0) = 0, so that (10.2.7) is satisfied, then 
A = 0 and we find the solution 


w(t) = Becosnt. 
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This means that the solution we have found of our differential equation with 
boundary and initial conditions is 


Yn(x,t) = B-sinnacosnt. (10.2.11) 


And in fact any finite sum with real coefficients (or linear combination) of 
these solutions will also be a solution: 


y = a, sinxcost+ a2 sin2xcos2t+---a,sinkacoskt. 


Ignoring the rather delicate issue of convergence (which was discussed a 
bit in Section 6.2), we may claim that any infinite linear combination of the 
solutions (10.2.11) will also be a solution: 


Co 


y = >_ bj sin jx cos jt. a1) 


j=1 


Now we must examine the initial condition (10.2.8). The mandate 
y(z,0) = f(x) translates to 


» b; sin jx = y(x,0) = f(z) (10.2.13) 
or ss 
ys bjp3(«) = y(x,0) = f(z). (10.2.14) 


Thus we demand that f have a valid Fourier series expansion. We know from 
our studies in Chapter 6 that such an expansion is correct for a rather broad 
class of functions f. Thus the wave equation is solvable in considerable gen- 
erality. 

Now fix m # n. We know that our eigenfunctions ~; satisfy 


" 2 " 2 
Pm = Tl Pm and Pn = 1 Pn- 


Multiply the first equation by y, and the second by y,, and subtract. The 
result is 


Lr Yin — PmPy = (Nn? — m7) OnOm 


[Pn Ym — Pm¥n]’ = (n? _ m?)OnYm : 
We integrate both sides of this last equation from 0 to 7 and use the fact 
that y;(0) = y;(7) = 0 for every j. The result is 
TT 


0 = [GnYm — PmPnl 


= (n? —m?) a Pm(X)Pn(2) dx. 


0 
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Thus zs 
| sinmz sin nz dz = 0 forn #m (10.2.15) 
0 


or 


i Pm(£)Pn(x) dx = 0 forn Am. (10.2.16) 
0 


Of course this is a standard fact from calculus. But now we understand it as an 
orthogonality condition (see Sections 4.1, 6.5), and we see how the condition 
arises naturally from the differential equation. As we have seen in Chapter 
4, these ideas fit rather naturally into the general context of Sturm—Liouville 
problems. 

In view of the orthogonality condition (10.2.16), it is natural to integrate 
both sides of (10.2.14) against yz(x). The result is 


[ F(z)-pr(z)de = is (= be,(a)) pele) dx 


= d4; | pj (x) px (x) dx 
jap 7? 
T 
—by. 
ras 
Here we use the fact that the integral is 0 when j 4 k and is equal to 7/2 
otherwise. 


The by are the Fourier coefficients that we studied in Chapter 6. Using 
these coefficients, we have Bernoulli’s solution (10.2.12) of the wave equation. 


eS 


Exercises 


1. Find the eigenvalues \,, and the eigenfunctions y, for the equation y” + 
Ay = 0 in each of the following instances. 


(a) y(0)=0, y(n/2) =0 


(b) y(0)=0, y(2r) =0 

(c) y(0)=0, y()=0 

(d) y(0)=0, y(L)=0 for L>0 
(e) y(-L)=0, y(L)=0 forL>0 


(f) y(a)=0, y(b)=0 fora<b 
Solve the following two exercises without worrying about convergence of 
series or differentiability of functions. 
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FIGURE 10.5 
Wave of fixed shape moving to the left. 


2. If y = F(a) is an arbitrary function, then y = F(x + at) represents a 
wave of fixed shape that moves to the left along the x-axis with velocity 
a (Figure 10.5). 


Similarly, if y = G(x) is another arbitrary function, then y = G(a—at) 
is a wave moving to the right, and the most general one-dimensional wave 
with velocity a is 


y(a,t) = F(x + at) + G(x — at). (x) 


(a) Show that (*) satisfies the wave equation. 

(b) It is easy to see that the constant a in the wave equation has the 
dimension of velocity. Also, it is intuitively clear that if a stretched 
string is disturbed, then the waves will move in both directions away 
from the source of the disturbance. These considerations suggest in- 
troducing the new variables a = x + at, 6 = x — at. Show that with 
these independent variables, the wave equation becomes 


Oy 


daop °° 


From this derive (*) by integration. Formula (*) is called d’Alembert’s 
solution of the wave equation. It was also obtained, slightly later and 
independently, by Euler. 
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3. Consider an infinite string stretched taut on the x-axis from —oo to +00. 
Let the string be drawn aside into a curve y = f(x) and released, and 
assume that its subsequent motion is described by the wave equation. 


(a) Use (x) in Exercise 2 to show that the string’s displacement is given 
by d’Alembert’s formula 


ule.) = SUF (e+ at) + fle —at)]. (8) 


Hint: Remember the initial conditions. 

(b) Assume further that the string remains motionless at the points « = 
0 and x = z (such points are called nodes), so that y(0,t) = y(7,t) = 
0, and use (**) to show that f is an odd function that is periodic 
with period 27 (that is, f(—x) = f(x) and f(# +27) = f(z)). 

(c) Show that since f is odd and periodic with period 27 then f neces- 
sarily vanishes at 0 and 7. 

(d) Show that Bernoulli’s solution of the wave equation can be written 
in the form (**x). Hint: Note that 2sin nz cos nat = sin[n(# + at)| + 
sin[n(x — at)]. 

4. Solve the vibrating string problem in the text if the initial shape y(z,0) = 
f(x) is specified by the given function. In each case, sketch the initial 
shape of the string on a set of axes. 

(a) 
sa) =f Qca/n ik OS a <r /2 
2e(n—a)/m if ar/2<aK<a 


(b) 


(c) 
x if 0 <a<7/4 
f(x) = w/4 if mw/4<a<3n/4 
m—-x if 3n/4<a<a7 


5. Solve the vibrating string problem in the text if the initial shape y(,0) = 
f(a) is that of a single arch of the sine curve f(x) = csin«. Show that 
the moving string always has the same general shape, regardless of the 
value of c. Do the same for functions of the form f(x) = csinnz. Show 
in particular that there are n — 1 points between x = 0 and x = 7 at 
which the string remains motionless; these points are called nodes, and 
these solutions are called standing waves. Draw sketches to illustrate the 
movement of the standing waves. 

6. The problem of the struck string is that of solving the wave equation with 
the boundary conditions 


y(0,t)=0, y(7,t) =0 
and the initial conditions 
oy 


Ot | 4-9 
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(These initial conditions mean that the string is initially in the equilib- 
rium position, and has an initial velocity g(x) at the point x as a result 
of being struck.) By separating variables and proceeding formally, obtain 
the solution 


y(a,t) = ‘> cj sin jx sin jat , 
j= 


where 5 7 
,=— in jx dz. 
Cj ree g(x) sin jx dx 


10.3. The Heat Equation 


Fourier’s Point of View 


In [FOU], Fourier considered variants of the following basic question. Let there 
be given an insulated, homogeneous rod of length a with initial temperature 
at each x € [0,7] given by a function f(x) (Figure 10.6). Assume that the 
endpoints are held at temperature 0, and that the temperature of each cross- 
section is constant. The problem is to describe the temperature u(x,t) of 
the point x in the rod at time t. Fourier [FOU] perceived the fundamental 
importance of this problem as follows: 


Primary causes are unknown to us; but are subject to simple and con- 
stant laws, which may be discovered by observation, the study of them 
being the object of natural philosophy. 

Heat, like gravity, penetrates every substance of the universe, its rays 
occupying all parts of space. The object of our work is to set forth the 
mathematical laws which this element obeys. The theory of heat will here- 
after form one of the most important branches of general physics .... 

I have deduced these laws from prolonged study and attentive compar- 
ison of the facts known up to this time; all these facts I have observed 
afresh in the course of several years with the most exact instruments that 
have hitherto been used. 


Let us now describe the manner in which Fourier solved his problem. First, it 
is required to write a differential equation which u satisfies. We shall derive 
such an equation using three physical principles: 


(1) The density of heat energy is proportional to the temperature u, 
hence the amount of heat energy in any interval [a, b] of the rod is 
: b 
proportional to f° u(x,t) de. 


(2) (Newton’s law of cooling) The rate at which heat flows from a 
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TU 


FIGURE 10.6 
The insulated rod. 


hot place to a cold one is proportional to the difference in temper- 
ature. The infinitesimal version of this statement is that the rate 
of heat flow across a point x (from left to right) is some negative 
constant times 0,u(z, t). 


(3) (Conservation of Energy) Heat has no sources or sinks. 


Now (3) tells us that the only way that heat can enter or leave any interval 
portion [a, b] of the rod is through the endpoints. And (2) tells us exactly how 
this happens. Using (1), we may therefore write 


d b 


a I, u(a, t) dx = n?[0,u(b, t) — O,u(a, t)]. 


We may rewrite this equation as 
b b 
/ Opu(a, t) dx = v | O2u(a, t) dx. 
a a 
Differentiating in b, we find that 


Ou = 7°02 u, (10.3.1) 


and that is the heat equation. 
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Math Nugget 


The English biologist J. B. S. Haldane (1892-1964) had this 
remark about the one-dimensional heat equation: “In scien- 
tific thought we adopt the simplest theory which will explain 
all the facts under consideration and enable us to predict 
new facts of the same kind. The catch in this criterion lies 
in the word ‘simplest.’ It is really an aesthetic canon such as 
we find implicit in our criticism of poetry or painting. The 
layman finds such a law as 
20°w = Ow 


“O02 Ot 
much less simple than ‘it oozes,’ of which it is the math- 
ematical statement. The physicist reverses this judgment, 
and his statement is certainly the more fruitful of the two, 
so far as prediction is concerned. It is, however, a statement 
about something very unfamiliar to the plain man, namely, 
the rate of change of a rate of change.” 


Suppose for simplicity that the constant of proportionality 7? equals 1. 
Fourier guessed that equation (10.3.1) has a solution of the form u(z,t) = 
a(x)G(t). Substituting this guess into the equation yields 


a(x) 3'(t) = a! (x) B(t) 


or 

Bt) _ al(x) 

B(t) ala) 
Since the left side is independent of x and the right side is independent of t, 
it follows that there is a constant K such that 


BO) _ og _ (2) 


B(t) a(x) 
b(t) = KB) 
a(x) = Ka(z). 


We conclude that 3(t) = Ce**. The nature of 3, and hence of a, thus 
depends on the sign of K. But physical considerations tell us that the tem- 
perature will dissipate as time goes on, so we conclude that K < 0. Therefore 
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a(z) = cos/~—Ka and a(x) = sin/—Kza are solutions of the differential 
equation for a. The initial conditions u(0,t) = u(a,t) = 0 (since the ends 
of the rod are held at constant temperature 0) eliminate the first of these 
solutions and force K = —j?, 7 an integer. Thus Fourier found the solutions 


u,(a,t) = eft sin jx », JEN 


of the heat equation. By linearity, any finite linear combination 
é 2 
u(x,t) = S~ bye? * sin jar (10.3.2) 
j=l 


of these solutions is also a solution. It is plausible to extend this assertion to 
infinite linear combinations. Using the initial condition u(#,0) = f(x) again 
raises the question of whether “any” function f(a) on [0,7] can be written as 
a (infinite) linear combination of the functions sin jx. 

Fourier’s solution to this last problem (of the sine functions spanning 
essentially everything) is roughly as follows. Suppose f is a function that is 
so representable: 


f(x) = D2 sin jx. (10.3.3) 


Setting « = 0 gives 
f(0) =0. 
Differentiating both sides of (10.3.3) and setting x = 0 gives 


f'(0) = S5 iby. (10.3.4) 
j=l 
Successive differentiation of (10.3.3), and evaluation at 0, gives 
POS Sori 
j=l 


for k odd (by oddness of f, the even derivatives must be 0 at 0). Here | | 
denotes the greatest integer function. Thus Fourier devised a system of in- 
finitely many equations in the infinitely many unknowns {b;}. He proceeded 
to solve this system by truncating it to an N x N system (the first N equa- 
tions restricted to the first N unknowns), solving that truncated system, and 
then letting N tend to oo. Suffice it to say that Fourier’s arguments contained 
many dubious steps (see [FOU] and [LAN]). 
The upshot of Fourier’s intricate and lengthy calculations was that 


bj = = | f(a) sin jx da. (10.3.5) 
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By modern standards, Fourier’s reasoning was specious; for he began 
by assuming that f possessed an expansion in terms of sine functions. The 
formula (10.3.5) hinges on that supposition, together with steps in which 
one compensated division by zero with a later division by oo. Nonetheless, 
Fourier’s methods give an actual procedure for endeavoring to expand any 
given f in a series of sine functions. 

Fourier’s abstract arguments constitute the first part of his book. The 
bulk, and remainder, of the book consists of separate chapters in which the 
expansions for particular functions are computed. 


EXAMPLE 10.3.1 Suppose that the thin rod in the setup of the heat equation 
is first immersed in boiling water so that its temperature is uniformly 100°C. 
Then imagine that it is removed from the water at time t = 0 with its ends 
immediately put into ice so that these ends are kept at temperature 0°C. Find 
the temperature u = u(x,t) under these circumstances. 


Solution: 
The initial temperature distribution is given by the constant function 


f(z)=100, O<au<n. 


The two boundary conditions, and the other initial condition, are as usual. 
Thus our job is simply this: to find the sine series expansion of this function 
f. We calculate that 


9) Tv 
bj = =| 100 sin jx dx 
T JO 
_ 200 cos ja |" 
T j 0 
= 200 — ~ A 
7 j ‘) 
0 if j = 20 is even 
= 4 
salad if j =2@-—1 is odd. 
Td 
Thus 
F(a) 400 si sin3x2  sindx 
x) = —|sinzr-4 t free | 
T 3 5 


Now, referring to formula (10.3.2) from our general discussion of the heat 
equation, we know that 


4 1 1 
u(x,t) = zi (< sin x + = sin 3a + sor sin5a+-- +) . a 
7 
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EXAMPLE 10.3.2 Find the steady-state temperature of the thin rod from our 
analysis of the heat equation if the fixed temperatures at the ends x = 0 and 
x= are w, and wz respectively. 


Solution: 
The phrase “steady-state” means that 0u/Ot = 0, so that the heat equation 
reduced to 0?u/0x? = 0 or d?u/dx? = 0. The general solution is then u = 
Ax + B. The values of these two constants A and B are forced by the two 
boundary conditions. 

In fact a little high school algebra tells us that 


u=wi+ —(we —wi)a. a 
T 


The steady-state version of the 3-dimensional heat equation 


> ( 07u e Oui dru Ou 
a“ | —>+554+55/=> 
Ox? Oy? Oz? Ot 
is 
Oru " Oru . Oru _ 
Ox? © Oy? © O22 
This last is called Laplace’s equation. The study of this equation and its so- 
lutions and subsolutions and their applications is a deep and rich branch of 
mathematics called potential theory. There are applications to heat, to gravi- 
tation, to electromagnetics, and to many other parts of physics. The equation 
plays a central role in the theory of partial differential equations, and is also 


an integral part of complex variable theory. 
i a 


Exercises 


1. Solve the boundary value problem 


ow Ow 
Ox? ot 
w(r,0) = f(x) 
w(0,t) = O 
w(r,t) = 0 


if the last three conditions—the boundary conditions—are changed to 


w(a,0) = f(a) 
w(0,t) = wi 
w(r,t) = we. 


[Hint: Write w(2,t) = W(a,t) + g(x), where g(x) is the function that 
we produced in Example 10.3.2.] 
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2. Suppose that the lateral surface of the thin rod that we analyzed in the 
text is not insulated, but in fact radiates heat into the surrounding air. 
If Newton’s law of cooling (that a body cools at a rate proportional to 
the difference of its temperature with the temperature of the surrounding 
air) is assumed to apply, then show that the 1-dimensional heat equation 


becomes 
po Ot ees wo) 
Ox? At 
where c is a positive constant and wo is the temperature of the surround- 
ing air. 


3. In Exercise 2, find w(z, t) if the ends of the rod are kept at 0°C, wo = 0°C, 
and the initial temperature distribution on the rod is f(z). 

4. In Example 10.3.1, suppose that the ends of the rod are insulated instead 
of being kept fixed at 0°C. What are the new boundary conditions? Find 
the temperature w(x,t) in this case by using just common sense—and 
not calculating. 

5. Solve the problem of finding w(x, t) for the rod with insulated ends at 
x = 0 and « = a (see the preceding exercise) if the initial temperature 
distribution is given by w(x,0) = f(a). 

6. The 2-dimensional heat equation is 

(Se, Fe) _ oe 
Ox? Oy? Ot” 


Use the method of separation of variables to find a steady-state solution 
of this equation in the infinite half-strip of the x-y plane bounded by the 


lines c = 0, x = 7, and y = 0 if the following boundary conditions are 
satisfied: 
w(0,y,t) = 0 w(m,y,t) =0 
w(x,0,0) = f(x) lim w(x,y,t) = 0. 
y— too 


7. Derive the 3-dimensional heat equation 


(2 , Sw, Pw _ aw 
Ox? Oy? Az?) at 


by adapting the reasoning in the text to the case of a small box with 
edges Ax, Ay, Az contained in a region R in x-y-z space where the 
temperature function w(z, y, z,t) is sought. [Hint: Consider the flow of 
heat through two opposite faces of the box, first perpendicular to the 
x-axis, then perpendicular to the y-axis, and finally perpendicular to the 
z-axis. ] 
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TT 


10.4. The Dirichlet Problem for a Disc 


We now study the two-dimensional Laplace equation, which is 


Ow dw 
Aw =~ +—5 =0. 

Ox? ~— Oy? 
It will be useful for us to write this equation in polar coordinates. To do so, 
recall that 


r=ae+y , c=rcosd , y=rsind. 


Thus 
3) Ox O  OyOA ) a) 
— = ~——4+~— =cosd— + sind— 
Or Or Ox Or Oy rae Ox Tae S 
O. = OOO OUI emp eet nos cies 0. 
00 =—— (00 Ax) OO Oy Ox Oy” 


We may solve these two equations for the unknowns 0/0x and 0/0y. The 
result is 
() QO sin@ 0 () O  cosdO 


Bg = 885. — - 20 and Ta ae 90° 


A tedious calculation now reveals that 


OP Oo? ( O- siné =) ( QO sind =) 
= —5+-—5 = [| cosd— — — | | cosé— — ——— 
y 


Ox? ~~ Oy? Or r 00 Or r 00 
ET pO CON ONT ay 0. KOSH. 
Or r 060 on Or r 006 
2 1 2 
=O 5h 0 ger (10.4.1) 


Or?" r Or * 7? 0G? 


Let us fall back once again on the separation of variables method. We shall 
seek a solution w = w(r,?) = u(r) - u(@) of the Laplace equation. Using the 
polar form (10.4.1) of the Laplacian, we find that this leads to the equation 


Thus 


Since the left-hand side depends only on r, and the right-hand side only on 6, 
both sides must be constant. Denote the common constant value by 4. 
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Then we have 
v’ +rAv = 0 (10.4.2) 


and 
2,01 


reu" +ru'—u=0. (10.4.3) 


If we demand that v be continuous and periodic, then we must have (because 
of equation (10.4.2)) that A > 0 and in fact that \ = n? for some nonnegative 
integer n (so that we end up with solutions v = sinn@ and v = cosn@). We 
have studied this situation in detail in Section 10.2. For n = 0 the only suitable 
solution is v = constant and for n > 0 the general solution (with \ = n?) is 


y = Acosné+ Bsinné. 
We set \ = n? in equation (10.4.3) and obtain! 
ru +ru’—n?u=0. 


We may solve this equation by guessing u(r) = r™. Plugging that guess into 
the differential equation, we obtain 


r™(m? —n*)=0. 
Now there are two cases: 


(i) Ifn=0 then we obtain the repeated root m = 0,0. Now we proceed 
by analogy with our study of second-order linear equations with 
constant coefficients, and hypothesize a second solution of the form 
u(r) = Inr. This works, so we obtain the general solution 


u(r) =A+ Blnr. 


(ii) Ifn > 0 then m = +n and the general solution is 


u(r) = Ar® + Bro”. 


We are most interested in solutions u that are continuous at the origin, 
so we take B = 0 in all cases. The resulting solutions are 


n=0, w =ao/2 a constant 
n=1, w =r(a; cos @ + b; sin @) 

= 2). w =r? (ag cos 20 + be sin 20) 
n=3, w= r (a3 cos 36 + bs sin 36) 


1This is Euler’s equidimensional equation. The change of variables r = e* transforms 
this equation to a linear equation with constant coefficients, and that can in turn be solved 
with our standard techniques. 
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Of course any finite sum of solutions of Laplace’s equation is also a solu- 
tion. The same is true for infinite sums. Thus we are led to consider 


1 le 
w= w(r,é) = 30 + ys r! (a; cos j6 + b; sin 76). 
j=0 


On a formal level, letting r — 17 in this last expression gives 


1 — en 
gat SoG cos j9 + b; sin 79) . 


jal 


We draw all these ideas together with the following physical rubric. Con- 
sider a thin aluminum disc of radius 1, and imagine applying a heat distri- 
bution to the boundary of that disc. In polar coordinates, this distribution is 
specified by a function f(0). We seek to understand the steady-state heat dis- 
tribution on the entire disc. So we seek a function w(r, 6), continuous on the 
closure of the disc, which agrees with f on the boundary and which represents 
the steady-state distribution of heat inside. Some physical analysis shows that 
such a function w is the solution of the boundary value problem 


Aw = 0 
wlan = ff. 
Here we use the notation 0D to denote the boundary of D. 
According to the calculations we performed prior to this last paragraph, 


a natural approach to this problem is to expand the given function f in its 
Fourier series: 


1 foe) 
f(9) = 57% le So (a; cos j@ + b; sin 78) 
j=l 
and then posit that the w we seek is 


1 ae 
w(r, 6) = 300 + SS 1? (a; cos 70 + b; sin j@) . 
j= 


This process is known as solving the Dirichlet problem on the disc with bound- 
ary data f. 


EXAMPLE 10.4.1 Follow the paradigm just sketched to solve the Dirichlet 
problem on the disc with f(#) = 1 on the top half of the boundary and 
f(@) = —1 on the bottom half of the boundary. 


Solution: 
The data function f is odd on the interval [—7,7z]. It is straightforward to 
calculate that the Fourier series (sine series) expansion for this f is 


4 sin 30 sin 50 
6) = —[ sin Boss 
f() = (sin fags i oa ) 
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The solution of the Dirichlet problem is therefore 


4 3 sin 30 ° sin 50 
u(r6) = 2(rsina +" eh oe te), 
7 


10.4.1 The Poisson Integral 


We have presented a formal procedure with series for solving the Dirichlet 
problem. But in fact it is possible to produce a closed formula (i.e., an integral 
formula) for this solution. We now make the construction explicit. 

Referring back to our Fourier series expansion for f, and the resulting 
expansion for the solution of the Dirichlet problem, we recall that 


1/7 : 1 [” ie 
Somers f(y)cosjpdp and b; = ‘= f(y) sin jp de. 
Thus 
Cie mee 1 Tere 
w(r,8) = 540 a As 


1 
+= f(e)sin jp dpsin 70) : 


This, in turn, equals 


1 1 [* 
x00 + — > ie fg)r’ (cos ip 0s, +sin jesin 7) dy 
= 


1 ef 


= =ay+ - fier (costo 2 ete) 


We finally simplify our expression to 


u(r.) == f f(¢) 5+ dor cos j(0- 9) dy. 


j=1 


It behooves us, therefore, to calculate the expression inside the large paren- 
theses. For simplicity, we let a = 0 — y and then we let 


z=re'® =r(cosa+isina). 


Likewise 
zr =rre' = r"™(cosna +isinna). 
In what follows, if z= x+y, then we let Rez = x denote the real part of 
z and Im z = y denote the imaginary part of z. Also Z = x—iy is the conjugate 
of z. 
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Then 


= YS j Sie ae = j 
p72." cosja = Re ee z 


+z)(1-2) 
- Re ( a — =P ) 
= |2Z/ 
~ 2W1—2zP 
1—r? 


2(1 — 2rcosa+r?) ’ 


Putting the result of this calculation into our original formula for w we 
finally obtain the Poisson integral formula: 


1 


1-r? 


Observe what this formula does for us: It expresses the solution of the Dirichlet 
problem with boundary data f as an explicit integral of a universal expression 
(called a kernel) against that data function f. To be very plain about this, 
the kernel here is 


1 1—r? 


Pe Qn 1—2rcos(O—y) +r?" 


There is a great deal of information about w and its relation to f contained 
in formula (10.4.4). As just one simple instance, we note that when r is set 
equal to 0 then we obtain 


LOS 1 Foe: 


27 Ja 


This says that the value of the steady-state heat distribution at the origin is 
just the average value of f around the circular boundary. 
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Math Nugget 


Siméon Denis Poisson (1781-1840) was an eminent French 
mathematician and physicist. He succeeded Fourier in 1806 
as Professor at the Ecole Polytechnique. In physics, Pois- 
son’s equation describes the variation of the potential inside 
a continuous distribution of mass or in an electric charge. 
Poisson made important theoretical contributions to the 
study of elasticity, magnetism, heat, and capillary action. 
In pure mathematics, the Poisson summation formula is a 
major tool in analytic number theory, and the Poisson in- 
tegral pointed the way to many significant developments in 
Fourier analysis. In addition, Poisson worked extensively in 
probability theory. It was he who identified and named the 
“law of large numbers;” and the Poisson distribution—or 
“law of small numbers”—has fundamental applications in 
all parts of statistics and probability. 

According to Abel, Poisson was a short, plump man. 
His family tried to encourage him in many directions, from 
being a doctor to being a lawyer, this last on the theory 
that perhaps he was fit for nothing better. But at last he 
found his place as a scientist and produced over 300 works 
in a relatively short lifetime. “La vie, c’est le travail (Life is 
work),” said Poisson—and he had good reason to know. 


EXAMPLE 10.4.2 Consider an initial heat distribution on the boundary of the 
unit disc which is given by a “point mass.” That is to say, there is a “charge 
of heat” at the point (1,0) of total mass 1 and with value 0 elsewhere on OD. 
What will be the steady-state heat distribution on the entire disc? 


Solution: 

Think of the point mass as the limit of functions that take the value N on 
a tiny interval of length 1/N. Convince yourself that the Poisson integral of 
such a function tends to the Poisson kernel itself. So the steady-state heat dis- 
tribution in this case is given by the Poisson kernel. This shows, in particular, 
that the Poisson kernel is a harmonic function. | 
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a 


Exercises 

1. Solve the Dirichlet problem for the unit disc when the boundary function 
f(0) is defined by 
(a) f(0) =cos0/2, -17<O0<7 
(b) f(@)=0, -~t™7<O0<n 

_ 0 if -7<60<0 
(<) #0) = snd if O<0<7 
0 if -7<60<0 

(a) s@)={ § if O0<0<a 
(e) f(0)=07/4, —-17<O0<7 

2. Show that the Dirichlet problem for the disc {(a,y) : 27 + y? < R’}, 
where f(0) is the boundary function, has the solution 


w(r,0 = 30+ LG y( (a; cos 76 + b; sin j6) 


where a; and b; are the Fourier coefficients of f. Show also that the 
Poisson integral formula for this more general disc setting is 


r 


ee) =5 , Rh? —2Rr poo p)+r? Fle) de. 


[Hint: Do not solve this problem from first principles. Rather, do a 
change of variables to reduce this new problem to the already-understood 
situation on the unit disc.] 

3. Let w be a harmonic function in a planar region, and let C’ be any cir- 
cle entirely contained (along with its interior) in this region. Prove that 
the value of w at the center of C’ is the average of its values on the 


circumference. 
4. Ifw = F(a,y) = F(r,0), with « =rcos@ and y =rsin0, then show that 
Ow , Ow _ 1/9 (dw _ lew 
Or? ~~ Oy? or L Or \ Or r 06? 
_ Ow _low , 1 Ow 
Or? ' r Or | r? AG? 
[Hint: We can calculate that 
ou = oe cos 6 + a sin@ and oe = oe (sind) + Fe (0086) 
2 
Similarly, compute 2 (3) a nd a | 


5. Use your symbol manipulation software, such as Maple or Mathematica, 
to calculate the Poisson integral of the given function on [—7, 7]. 


(a) f(0) =1n?6 
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(b) f(0) = 6° - cosé 
(c) f(0) =e°-sind 
(d) f(@)=e*-né 


Historical Note 
Fourier 


Jean Baptiste Joseph Fourier (1768-1830) was a mathematical physicist of 
some note. He was an acolyte of Napoleon Bonaparte and accompanied the 
fiery leader to Egypt in 1798. On his return, Fourier became the prefect of 
the district of Isére in southeastern France; in that post he built the first real 
road from Grenoble to Turin. He also became the friend and mentor of the 
boy Champollion, who later was to be the first to decipher the Rosetta Stone. 

During these years he worked on the theory of the conduction of heat. 
Euler, Bernoulli, d’Alembert, and many others had studied the heat equation 
and made conjectures on the nature of its solutions. The most central issues 
hinged on the problem of whether an “arbitrary function” could be represented 
as a sum of sines and cosines. In those days, nobody was very sure what a 
function was and the notion of convergence of a series had not yet been defined, 
so the debate was largely metaphysical. 

Fourier actually came up with a formula for producing the coefficients of a 
cosine or sine series of any given function. He presented it in the first chapter of 
his book The Analytic Theory of Heat. Fourier’s ideas were controversial, and 
he had a difficult time getting the treatise published. In fact he only managed 
to do so when he became the Secretary of the French National Academy of 
Sciences and published the book himself. 

The series that Fourier studied, and in effect put on the map, are now 
named after him. The subject area has had a profound influence on mathe- 
matics as a whole. Riemann’s theory of the integral—the one that is used in 
most every calculus book—was developed specifically in order to study cer- 
tain questions of the convergence of Fourier series. Cantor’s theory of sets was 
cooked up primarily to address issues of sets of convergence for Fourier series. 
Many of the modern ideas in functional analysis—the uniform boundedness 
principle, for example—grew out of questions of the convergence of Fourier 
series. Dirichlet invented the modern rigorous notion of “function” as part 
of his study of Fourier series. As we have indicated in this chapter, Fourier 
analysis is a powerful tool in the study of partial differential equations (and 
ordinary differential equations as well). 

Fourier’s name has become universally known in modern analytical sci- 
ence. His ideas have been profound and influential. Harmonic analysis is the 
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modern generalization of Fourier analysis, and wavelets are the latest imple- 
mentation of these ideas. 


Historical Note 
Dirichlet 


Peter Gustav Lejeune Dirichlet (1805-1859) was a German mathematician 
who was deeply influenced by the works of the Parisians—Cauchy, Fourier, 
Legendre, and many others. He was strongly influenced by Gauss’s Disquisi- 
tiones Arithmeticae. This was quite a profound but impenetrable work, and 
Dirichlet was not satisfied until he had worked through the ideas himself in 
detail. He was not only the first to understand Gauss’s famous book, but also 
the first to explain it to others. 

In later life Dirichlet became a friend and disciple of Gauss, and also a 
friend and advisor to Riemann. In 1855, after lecturing in Berlin for many 
years, he succeeded Gauss in the professorship at Gottingen. 

In 1829 Dirichlet achieved two milestones. One is that he gave a rigorous 
definition of the convergence of series. The other is that he gave the definition, 
that we use today, of a function. In particular, he freed the idea of function 
from any dependence on formulas or laws or mathematical operations. He 
applied both these ideas to the study of the convergence of Fourier series, and 
gave the first rigorously proved convergence criterion. 

Between 1837 and 1839, Dirichlet developed some very remarkable appli- 
cations of mathematical analysis to number theory. In particular, he proved 
that there are infinitely many primes in any arithmetical progression of the 
form a+ bn with a and 6 relatively prime. His studies of absolutely convergent 
series also appeared in 1837. Dirichlet’s important convergence test for series 
was not published until after his death. 

Dirichlet also engaged in studies of mathematical physics. These led, in 
part, to the important Dirichlet principle in potential theory. This idea estab- 
lishes the existence of certain extremal harmonic functions. It was important 
historically, because it was the key to finally obtaining a rigorous proof of 
the Riemann mapping theorem. It is still used today in partial differential 
equations, the calculus of variations, differential geometry, and mathematical 
physics. 

Dirichlet is remembered today for the Dirichlet problem, for his results 
in number theory (the useful “pigeonhole principle” was originally called the 
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“Dirichletscher Schubfachschluss” or “Dirichlet’s drawer-shutting principle” ). 
He is one of the important mathematicians of the nineteenth century. 


TT 


Problems for Review and Discovery 


A. Drill Exercises 


1. Find the eigenvalues \, and eigenfunctions yn for the equation y” + Ay = 
O in each of the following cases. 


(a) y(-2 a y(2) =0 


(b) y(0) = y(3) = 0 
(c) y(1) = y(4) = 0 
(d) y(- 3) y(0) =0 


2. Solve the a string problem in Section 10.2 if the initial shape 
y(x,0) = f(x) is specified by the function f(x) = x + ||. Sketch the 
initial shape of the string on a set of axes. 

3. Solve the Dirichlet problem for the unit disc when the boundary function 
f(@) is defined by 
(a) f(0) =sin0/2, -7<0<7 
(b) f(0)=0+|0|, -r<O<n 
(c) f(@)=0@, -7<O0<a 

4. Find the solution to the Dirichlet problem on the unit disc with boundary 


data 

(a) (8) = |6| 

(b) g(0) =sin? @ 

(c) h(@) = cos0/2 

(d) f(@) =0/2 

5. Find a solution to this Dirichlet problem for a half-annulus: 


Ou 1du 1 Ou 

Or? or Orr? 06? 
u(r,0) = sinar, 1<r<2 
u(r,m) = 0,1<r<2 


u(1,0) = u(2,dé)=0,0<0<7n. 


= 0,1<r<2,0<0<7 
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B. Challenge Problems 


1. Is it possible for a harmonic function to vanish on an entire line segment? 
Give an example. 

2. Use methods introduced in this chapter to find a solution of the boundary 
value problem 


Ou Ou 
Ou Ou 
By (04) ag (Mt) =0, t>0 
u(x, 0) 3x4, 0<a<rT. 


3. Use methods introduced in this chapter to find a solution of the boundary 
value problem 


Ou Oru 
Ou 
u(0,t) = 0, u(l,é) + F(t) =0, t>0. 
u(z,0) = f(@),0<a<1 


4. Use methods introduced in this chapter to find a solution of the boundary 
value problem 


Ou ru 

Ot = ea a ae Oe ea ced 
u(0,t) = u(1,t)=1,t>0 
u(z,0) = 1,0<a<1. 


5. Use methods introduced in this chapter to find a solution of the boundary 


value problem 


Oru 
Ot 
u(0, t) 
u(x, 0) 
Ou 


Bp (9) 


ou 
Ox? 
u(1,t)=0,t>0 
1—cos’ re , O0<a<l 


O0<a2<1,t>0 


l—-sing,0<a<l. 


6. Use methods introduced in this chapter to find a solution of the boundary 


value problem 


Ou 
Ox? ’ 
u(2,t)=0,t>0 

u(2—-2),0<a4<2 


O0<x2<2,t>0 


cos4m7x,0<a4<2. 
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C. 


Problems for Discussion and Exploration 


1. 


Let w be a harmonic function in a planar region, and let D be any disc 
entirely contained in this region. Prove that the value of w at the center 
of D is the average of the values of w on D. 


Let w be a real-valued, twice continuously differentiable function on pla- 
nar region U. Suppose that both w and w? are harmonic. Prove that w 
must be constant. 

It is a fact (not obvious) that if u is harmonic on a connected region U in 
the plane and if (xo, yo) € U then u has a convergent power series about 
(xo, yo) of the form 


u(x,y) =) 9 aj,0(@ — x0)’ (y — yo)" 


jk 
for |(« — x0)” + (y — yo)| < €, some small € > 0. 


Use this information to show that if u vanishes on some disc in U then u 
is identically 0 on all of U. 

Use methods introduced in this chapter to find a solution of the boundary 
value problem 


Ou ui Ou 

we eS Be gy er Re kre 
Ou Ou 
7g Or 8) ppb yt)=0,0<y<1,t>0 
u(z,0,t) = u(#,1,t)=0,0<a<1,t>0 
u(z,y,0) = f(a,y),0<a<1,0<y<1. 


A vibrating circular membrane, or drum, of radius 1 with edges held fixed 
in the plane and with displacement u(r,t) (r is radius and ¢ is time) is 
given. That is to say, the displacement of any point of the drum depends 
only on the distance from the center of the drum and the elapsed time. 
This situation is described by the boundary value problem 


O7u (Ou 10u 
es SS — + —-_— Lt 
IE a(S nay »,O<r<l1,t>0 
u(1,t) = 0,t>0 
u(r, €) remains bounded as r > 0+ 
u(r,0) = f(r), 0<r<l1 
SH (7,0) = g(r),0<r<l. 


Here f is the initial displacement and g is the initial velocity. Use the 
method of separation of variables, as introduced in this chapter, to find 
a solution of this boundary value problem. [Hint: Bessel functions will 
be involved.] 
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Taylor & Francis Group 
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Table of Notation 


1.1 


F(a, y, dy/dz, d?y/dx?,...,d” f /dx”) 


dy/dx 


y' + a(x)y = (a) 


eS a(x) dx 
M(a,y)dx + N(x, y)dy =0 
Of /Ox = M, Of /dy=N 
OM /dy = ON/Oz 
g(tx, ty) = t° g(x,y) 


ay” + by’ + cy=d 


/ / 
ae aan 
Oey Ti U24o =f 


= [Pf W/u2 e- F7@) 4 de . 


Fs 
M 


mass 
force 

gravitational constant 
constant of 
proportionality 
differential equation 
(ODE) 

Leibniz notation 
first-order, linear 
equation 
integrating factor 
exact equation 
exactness condition 
exactness criterion 
homogeneity 
condition 
integrating factor 
increment of arc 
length 
electromotive force 
current 

resistance 
inductance 
capacitance 

charge 
second-order, linear 
ODE 


variation of 
parameters 


second solution in 
terms of first 
spring force 

mass 

Hooke’s constant 
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2.5 amplitude 
frequency 
period 
damping force 
external force 
position of planet 
force 
mass 
acceleration 
universal gravitational 
constant 


higher-order 

linear equation 

a particular solution 
general solution to 
the homogeneous 
equation 


Yg = Ary + Aaya +--- 
+An—1Yn—1 + AnYn : the general solution 


In : nth Bessel function 
a polynomial 
a power series 
a power series centered 
at a 
formula for power 
series coefficients 
n! . n factorial 
f(a) = ee, f(0)/j!- 23 + Ry(z) : Taylor expansion 
Rna(x) = ft) (€)/(n+ 1)! - 2741 remainder term for 
Taylor expansion 
Cm = jo aj bm—j : Cauchy product 
y= a yg aya? . Frobenius solution at a 
regular 
singular point 
f(m) = m(m — 1) + mpo + go = 0 : equation for m in 
Frobenius solution 
steady-state 
temperature 


orthogonal functions 
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d d 4.1 a Sturm-Liouville 
equation 
eigenvalues 

Ym : eigenfunctions 
xdy/dx) + (k?x — n?/x)y = 0 Bessel’s equation 
y” + (A + 16dcos2x)y = 0 : Mathieu’s equation 
Ut = C7 Une : the wave equation 
(1 — x)y” — 2ay’ + £(€+ 1)y =0 ; Legendre’s equation 
Chebyshev’s equation 
Hermite’s equation 
Lagrange’s equation 
state function in 
quantum mechanics 
Joule 
linear operator from 
quantum mechanics 
Be : total relative error 
4a + jar (a; cos ja + 6; sin jx) ; a Fourier series 
a Fourier coefficient 
a Fourier coefficient 
formula for ag 


formula for a; 


formula for 6; 

partial sum of Fourier 
series 

Cesaro means of a 
Fourier series 

odd extension of the 
function g 


even extension of the 
function g 
the length of an 
interval 
lju-v| < |jull - ||v|| : Cauchy-Schwarz 
-Buniakovski 
inequality 
Ju + v|| < ||ul] + |}v]| : triangle inequality 
0,1 : the space of 
continuous functions 
on [0,1] 
the Fourier transform 
of f 
a rotation 
the conjugate of f 
the reflection of f 
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Lf 7.1 the Laplace transform 
of f 
Laplace transform 
of the derivative 
Laplace transform of 
the second derivative 
the Laplace transform 
and translations 
derivative of the 
Laplace transform 


second derivative of 
the Laplace transform 


jth derivative of 
the Laplace transform 
the Laplace transform 
and the derivative 
the Laplace transform 
and the derivative 
L{xy"| = —(d/ds)[s?Y] + y(0) . the Laplace transform 
and the second derivative 
(x) = f f(a —t)g() de ; convolution 
(s) : Laplace transform of 
the convolution 
Ye : approximation to the 
impulse function 
erfe(x) = (2//7) fy edt : the erf function 
u state constant 
the space of Schwartz 
distributions 
seminorms on the 
Schwartz space 
a Schwartz distribution 
the distribution induced 
by the function f 
norm on a space of 
distributions 
the space Ce° 
the space of C™ functions 
a space of distributions 
a space of distributions 
~ = X0,1) : the scaling function 
w(x) = y(2x) — y(2a — 1) : the wavelet 


f* g(x) = 
Lf * g\(s) = L[f](s) - Lig] 


TABLE OF NOTATION 
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9.3 


V5 
Taf (a) = f(x — a) 
as f(x) = f(dx) 
j 
{2/2 a9; Tm} 
LD? = Diez W; 


{29/2a9; Tm} 


=Vo® Dio Wj 


an(m), bn(m), Cr 


ee 
mae 


pj,n(t) = 29/2yp(25t — k) 


Tx = [ar 


Wo 


p(2t) + p(2t _ 1) 
(2t) — y(2t — 1) 
Vew 


2 os 41,2n(t) + yj 41,2041(t) 


scale of function 
spaces 

the translation 
operator 

the dilation operator 
the orthogonal 
complement of 

Vj in Via 

an explicit 
orthonormal basis 
for W; 

orthonormal 
decomposition 

of L? 

an orthonormal basis 
for L? 

orthonormal 
decomposition 

for L? 

coefficients of the 
wavelet 

expansion 

a low-pass filter 

an unconditional 
basis 

an orthonormal basis 
for V; 

the support of w;,x 
the Haar wavelet 
space 


dilation equations 
direct sum of V and 
Ww 

a wavelet function 
the Haar wavelet 
space 

an orthonormal basis 
for W; 


basis vectors for 
dilation equation 


dilation equation 
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by e(t) = 2yj41,2%(t) — Ye j41,2041(0) 
95 = fis — Sj 
Vj41 = Vi BW; 
fj 
95 
Q;, Gj, Ay 


C(v) 


Ent(v) 
Am 


Ym 
Ou = 7° O2u 
A = 02/dx? + 8? /dy? 
82 /Or2 + (1/r)O/Or + (1/r2)02/00? 


dilation equation 
error term 
fundamental 
orthogonal 
decomposition 

an element of V; 
an element of W; 
the wavelet transform 
in matrix form 
median absolute 
deviation 
cumulative energy 
vector 

the entropy of v 
eigenvalues 
eigenfunctions 

the heat equation 
the Laplacian 

the Laplacian in 
polar coordinates 


the Poisson kernel 


Glossary 


Abel’s mechanics problem Given a nonnegative function T(y) that de- 
scribes how long it takes a bead to slide down a wire from height y, determine 
the shape of the curved wire that gives rise to the function T. 


additive identity An element 0 of the vector space V such that v-+0 =v 
for every vE V. 


additive inverse Given an element v of the vector space V, this is an ele- 
ment —v € V such that v + (—v) = 0. 


associated homogeneous equation Given the differential equation 


ay" + by’ + cy = f, 
the associated homogeneous equation is 


i 


ay” + by’ + cy =0. 


balls In a normed linear space, we have open balls 
B(x,r) ={t eX: ||t —x|] <r} 
and closed balls 


B(x,r)={teXx:||t—x] <r}. 


Bessel function A special function that arises in the solution of the Bessel 
differential equation 


zy” + xy’ + (a? — p*)y =0. 


Bessel’s equation The differential equation 


ay” + xy! + (2 —p*)y=0. 
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Bessel’s inequality 


Theorem: If {ug : a € A} is any orthonormal set in the Hilbert space H, 
and if Z(a) = (#,uq) for each a, then 


d= R@)?? < lel. 


acA 


boundary conditions If we are given a differential equation on an interval 
(a, b], then boundary conditions specify values for the solution at the points a 
and b. 


capacitance The force which stores electrical energy. 
catenary The curve that describes the shape of a hanging chain. 
Cauchy product A device for calculating the product of two power series. 


Cauchy-Schwarz-Buniakovski inequality This is the inequality 


I(v,w)| < |lvll- wll. 


Chebyshev’s equation This is the differential equation 


(1 —a)y" — ary’ +n¥ =0. 


C®™ Urysohn lemma 


Lemma: Let K and L be disjoint closed sets in R“. Then there is a C% 
function y on R% such that py =0 on K and y=1 on L. 


complex power series A power series with complex coefficients and a 
complex variable. 


constant perturbation method for linear, second-order equations A 
numerical method for solving differential equations in which one approximates 
the given equation by a constant-coefficient equation. 


convergence of a power series A power series converges at a point x if 


k 


lim ) a;x) 
k-+c0o0¢ 
j=0 
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converges. 


convex set A set F in a linear space X is convex if, whenever x,y € E and 
0<t<1, then 
(l-tha+tye Ek. 


convolution If f and g are integrable functions then their convolution is 
defined to be 


fate) = f fle-tglt)ae= f fog tat. 
critical point A point at which both da/dt and dy/dt equal 0. 


cumulative energy Let v € R” with v 4 0. Assume that y is the vector 
formed by taking the absolute value of each component of v and ordering the 
resulting nonnegative numbers from largest to smallest. Then we define the 
cumulative energy vector C(v) to be a vector in RN whose components are 
given by 


for j=1,2,...,N. 


d’Alembert’s solution to the wave equation This solution has the form 
1 
yle,t) = 5(F(t+ 2) + 9(t-2)). 
damped vibrations A simple harmonic motion with a damping force 
present. 
Daubechies wavelet A wavelet that is smooth and compactly supported. 


dialysis machine A machine designed by biomedical engineers to emulate 
the function of a kidney. 


differential equation An equation involving a function and some of its 
derivatives. 


dilation An action on Euclidean space induced by multiplying each variable 
by a fixed constant. Dilation by 6 is often denoted by ag. 
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dilation operators of Euclidean analysis These are the operators, for f 
a function on RY, given by 


asf(x) = f(de) and a f(a) = 5" f(a/6). 


Dirac delta mass This is the distribution that evaluates each test function 
at the origin. Intuitively this is a function that takes the value 0 at « 4 0 and 
takes the value +oo at the origin. 


Dirichlet kernel A kernel for summing Fourier series: 


sin (N + +) t 
sin st , 


Dy(t) —= 


Dirichlet problem The basic boundary value problem for the Laplacian. 
On the disc, for instance, we specify boundary values for a function u on the 
unit circle and ask that u be harmonic on the interior. 


Dirichlet’s theorem 


Theorem: Let f be a function on [—7,7] which is piecewise continuous. 
Assume that each piece of f is monotone. Then the Fourier series of f 
converges at each point of continuity c of f in [—-7,7a] to f(c). At other 
points x it converges to [f(a~) + f(at)|/2. 


discretization error This is 
En = Y(fn) — Yn- 


divergence of a power series A power series diverges at a point «x if it 
does not converge at x. 


eigenfunction If 7: X — X is a linear operator, then an eigenfunction or 
eigenvector for T is a vector v such that T’'v = Av for some scalar A. 


eigenvalue If 7: X — X is a linear operator, then an eigenvalue for T is a 
scalar A such that T’v = Av for some vector v. 


electromotive force The force that produces an electrical current. 


elementary transcendental function The familiar calculus functions sine, 
cosine, exponential, logarithm, their inverses and combinations. 


GLOSSARY 441 


entropy Let v = (v1, v2,...,vN) be an N-vector. Suppose that the v; assume 
k distinct values, 1 < k < N. Denote these distinct values by aj, a2,...,a% 
and let p(a;) be the relative frequency of a; in v. That is to say 0 < p(a;) <1 
is the number of times a; occurs in v divided by N. Then the entropy of v is 
defined to be the quantity 


k 


Ent(v) =} p(ae) - loga(1/p(ae)) 


l=1 


equation of biological growth The differential equation that describes a 
petri dish of bacteria or other life form that reproduces regularly. 


equation of exponential decay The differential equation that describes 
radioactive decay. 


essentially diagonal A linear operator T is essentially diagonal with re- 
spect to a wavelet basis if the matrix entries die off rapidly away from the 
diagonal. 


Euclidean 3-space The collection of all triples (x, y, z), where x,y,z € R. 


Euler method A numerical method for solving differential equations that 
is encapsulated in the formula 


Yeti =Yeth: f(re, ye) - 


Euler’s formula This is the formula 


e'’ =cosy+isiny. 


even function We say that f is even if f(—x) = f(a) for every wx. 


exact equation A differential equation that can be written in the form 
M(a,y)dx + N(a,y)dy = 0, with M = Of /0x and N = Of /Oy for some func- 
tion f. 


exactness condition ‘This is the condition 
OM ON 
Oy Ox” 


Fejér’s theorem 
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Theorem: Let f be a piecewise continuous function on [—7, 7]—meaning 
that the graph of f consists of finitely many continuous curves. Let s be the 
endpoint of one of those curves, and assume that lim,;_.,_ f(x) = f(s~) 
and lim,_,+ f(z) = f(st) exist. Then the Cesaro means of the Fourier 
series of f at s converges to [f(s~) + f(st)]|/2. 


first-order, linear equation A differential equation of the form y’+a(x)y = 
B(x). 


forced vibrations A simple harmonic motion with an external force acting 
on it. 
Fourier coefficients These are 
qT Tv 
aj=— f(a) cos ja dx and b= - f(x) sin jada. 
—t —T 


TT 


Fourier coefficients with respect to an orthonormal system Let H 
be a Hilbert space and {uq} an orthonormal system in H. If « € H, then the 
Fourier coefficients of x with respect to the orthonormal system are 


(a) = (2, Ua) - 


Fourier inversion The Fourier transform is univalent on the space of inte- 
grable functions. Its inverse, in Euclidean N-space, is given by 


f(y) = (@m)-% / Fleet aé. 


Fourier series of a function The expansion of a given function f into 
sines and cosines. 


Fourier-Stieltjes series The Fourier series of a measure. 


Fourier transform Let f be an integrable function on Euclidean space. 
Then the Fourier transform of f is 


fO= | fWe*é dt. 


RN 


Fourier transform of a Schwartz distribution Let be a Schwartz 
distribution and y a Schwartz function. Then 


Ay) = A). 
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Frobenius solution of a differential equation A differential equation 
solution of the form 


y = y(x) =2™- (ap + aya + aga* +---), 


where m may not be an integer. 


function defined by a power series On the interval of convergence, the 
power series 

co 

S- a; x 

j=0 
defines a function f. 


Gaussian white noise A signal composed of independent samples from a 
normal distribution with mean 0 and variance o?. 


Gauss- Weierstrass summation A technique for summing the inverse 
Fourier transform. 


general solution of a differential equation Usually a family of functions 
that describes all possible solutions to the differential equation. 


geometric interpretation of the solution This is usually the interpre- 
tation of the solution of a differential equation in terms of vector fields. 


Green’s function Let £ bea linear differential operator. A Green’s function 
is any solution of the equation 


LG(x,y) = d(y— 2). 


Haar basis The wavelet basis generated by step functions. 


hanging chain An interesting physical system is a chain, fixed at both 
ends, and hanging under its own weight. We can analyze such a system using 
a differential equation. 


heat equation This is the partial differential equation from physics that 
says 

Oy a) ary 

ot | Ox’ 
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Hermite equation This is the differential equation 


y” — 2axy’ + 2ay =0. 


Hermite polynomial A function defined by 


Heun’s method See improved Euler method. 


higher-order coupled harmonic oscillators A motion described by the 
equations 


ax 

me = —-kyryt+ k3(xo — x1) : 
dx 

ma = —koxe im k3(xe = x1) F 


These can be combined to yield a fourth-order differential equation. 


higher transcendental functions Transcendental functions defined by 
power series and that are not elementary. 


Hilbert transform The important linear operator given by 


pro fea. 


homogeneous equation ‘This could mean a differential equation whose 
right-hand side, or forcing term, is 0. It could also mean a differential equa- 
tion whose coefficients satisfy a certain homogeneity condition. 


homogeneous of degree a A function g is homogeneous of degree a if 
g(bx, by) = b° g(x,y). 


Hooke’s law _ The force exerted by a spring is proportional to its displace- 
ment. 
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imaginary number Any multiple of i. 


implicitly defined solution The solution of a differential equation that is 
expressed as an equation in x and y, but notin the form y = f(z). 


improved Euler method The numerical technique encapsulated by 


h 
WI = Vit 5° [f(xj,y5) + f(ej-41, 241), 


where 
Zj41 = Yj th- f(x5, 45) 
and j =0,1,2,.... 


impulse function Another name for the Dirac delta mass. Intuitively this 
is a function that equals 0 for « 4 0 and takes the value +00 at the origin. 


impulsive response The solution of a differential equation when the input 
is the Dirac delta mass. 


independent solutions These are usually solutions that are linearly inde- 
pendent. 


indicial equation The equation 
2m(m—1)+m-1=0 


for the index m in the Frobenius method. More generally, for the differential 
equation 
y" + py’ + qy =0, 


the indicial equation is 


m(m—1)+mpo + q- 


inductance The force which opposes any change in current. 


initial conditions Restrictions, usually imposed through values of the func- 
tion or its derivatives at a particular point, that restricts the choice of solution 
to a differential equation. Often the initial condition(s) are physically moti- 
vated. 
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inner product If V is a vector space, then an inner product on V is a 
mapping 
(e,e):VxVoR 


with these properties: 


(a) (v,v) 20; 
(b) (v,v) =0 if and only if v = 0; 


(c) (av + Bw,u) = a(v,u) + G(v,u) for any vectors u,v,w € V and 
scalars a, 3. 


inner product space A linear space equipped with an inner product. 
input function The stimulus of a system. 


integrating factor A factor which we can multiply through a differential 
equation to make it exact. 


interval of convergence The interval on which a power series converges. 


Kepler’s laws of planetary motion 


1. The orbit of each planet is an ellipse with the sun at one focus. 


2. The segment from the center of the sun to the center of an orbiting 
planet sweeps out area at a constant rate. 


3. The square of the period of revolution of a planet is proportional to 
the cube of the length of the major axis of its elliptical orbit, with 
the same constant of proportionality for any planet. 


Lagrange’s equation This is the differential equation 


cy” +(1—-2)y’ +ay=0. 


Laplace’s equation This is the differential equation 


Laplace transform This is the transform given by 


LAs) = Fle) = f° ee F(a) de. 
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Liénard’s equation This is the differential equation 


dx? dx 


qe tf@)y te) = 0. 


linear damping A spring for which the damping force is a linear function 
of dx/dt. 


linear operator A function T: V — W between vector spaces such that 
T(ax1 + C2X2) = c1T (x1) + c2T (X2) 


for any X1,X2 € V and any scalars cj, Co. 
linearization Approximation of a nonlinear system by a linear one. 
linear spring A spring for which the restoring force is a linear function of x. 


linear transformation A mapping of vector spaces T: V — W such that 
(a) T(v+w) =T(v) +T(w); 
(b) T(cv) = cT(v) for any scalar c. 


Maxwell’s field equations The law which forms the foundations of clas- 
sical electromagnetism, classical optics, and electric circuits. 


mean absolute deviation The mean absolute deviation or MAD is defined 
to be 


MAD(a) = median(|a1 — dmeal, |@2 — Gmeal,---;|@N — Gmeal) - 


Minkowski functional If F is a convex set in a linear space X, then the 
Minkowski functional jig of E is defined to be 


pp(x) =inf{t>0:t tx € E}. 


multi-resolution analysis A collection of subspaces {V;}jez of L?(R) is 
called a Multi-Resolution Analysis or MRA if the following are true: 


MRA, (Scaling) For each j, the function f € V; if and only if aaf € Vj41. 
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MRA, (Inclusion) For each j, V; C Vj41. 
MRAs3 (Density) The union of the V;’s is dense in L?: 


closure U Vip = T(R). 
jeZ 
MRA, (Maximality) The spaces V; have no non-trivial common inter- 


section: 
(| = {0}. 
JEZ 


MRA; (Basis) There is a function y such that {7;¢} jz is an orthonormal 
basis for Vo. 


Newton’s binomial formula A formula for (1 + «)?: 


(1+2)? =1+pr+ mpd) Wax a dat) dip 2) + 
ge eS ee ase: 
33 


Newton’s law of universal gravitation Newton’s inverse square law gov- 
erning the gravitational attraction between two planets. 


odd function We say that f is odd if f(—ax) = —f(«) for every f. 
Ohm’s law The law that governs electrical resistance. 


order of a differential equation The maximal order derivative that ap- 
pears in the equation. 


ordinary differential equations A differential equation that involves func- 
tions of one variable and ordinary derivatives. 


ordinary point A point at which the coefficients of a differential equation 
have convergent power series expansions. 


orthogonal complement If zx is an element of the Hilbert space H, then 
its orthogonal complement is 


z+ ={yeH: (z,y)} =0. 
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If E C H is a subspace, then its orthogonal complement is 
E+={yeH:(ly,e)=0 forall ec E}. 


orthogonal functions Two functions f, g are orthogonal if 


| fos(e) ae =0. 


orthogonality condition Two functions f, g are orthogonal if 


[ festa) ae =0. 


orthogonality condition with weight gq Two functions f, g are orthogonal 
with weight q if 


[ s2vateyate) ar = 0. 


orthogonal trajectory A curve that is pointwise orthogonal to a given 
family of curves. 


orthonormal system Let H be a Hilbert space. Then an orthonormal sys- 
tem in H is a collection of elements {u.} such that ||uq|| = 1 for each a and 
also (uq,Ug) = 0 whenever a F f. 


output function The response in a physical system. 


parallelogram law In an inner product space, 


lle + yll? + [2 — yl? = Qllzl]? + 2llyll?. 


partial differential equation A differential equation that involves func- 
tions of several variables and partial derivatives. 


particular solution This is a solution of the inhomogeneous equation 


ay” + by’ +cy=f. 


path of the system A path of the autonomous system is a function x = x(t), 
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y = y(t). 


Picard’s existence and uniqueness theorem A theorem that expresses 
the fact that most ordinary differential equations possess solutions. Also, un- 
der certain initial conditions, the solution is unique. 


Plancherel’s theorem If f is a square integrable function on R%, then 


(2n)-% / FOP dé = / f(a) ae. 


Poisson kernel The integration kernel that solves the Dirichlet problem. 


On the disc in the plane it is given by 
1 1-—r? 
P..(6) = ———_______- 
(9) 2n 1—2rcosé +r? 


potential theory The study of harmonic functions, subharmonic functions, 
and related ideas. 


power series _ A series of the form 


Co Co 
S- a,x) or Se a;(x—c)’. 
j=0 j=0 


pursuit curve Ifa dog is chasing a rabbit then its path describes a pursuit 
curve. 


quantum mechanics This is a fundamental theory in physics which de- 
scribes nature at the smallest scales of energy levels of atoms and subatomic 
particles. 


radius of convergence The radius of the interval on which a power series 
converges. 


Rayleigh’s problem The problem of studying the two-dimensional flow of 
a semi-infinite extent of viscous fluid, supported on a flat plate, caused by the 
sudden motion of the flat plate in its own plane. 


real analytic function A function that is locally representable by a con- 
vergent power series. 
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real power series A power series with real coefficients. 


recursion One or more equations that relate later indexed aj;s to earlier 
indexed ajs. 


reduction of order There are various algebraic and notational tricks for 
reducing the order of a differential equation from 2 to 1, or more generally 
from order k to order k—1. This usually facilitates the solving of the equation. 


regular singular point We say that a singular point xo for the differential 
equation 

yt+p-y +q-y=0 
is a regular singular point if 


(w—20)-p(z) and — (x ~ &)*q(x) 


are analytic at xp. 
resistance The force which opposes an electrical current. 
Riemann-Lebesgue lemma [If f is an integrable function, then 


lim |f(é)| =0. 


E00 


Riesz—Fisher theorem 
Theorem: Let {ua}aea be a complete orthonormal system in H. Let 
y € (7(A). Then y = @ for some z € H. 

Riesz representation theorem 


Theorem: If \ is a bounded linear functional on the Hilbert space H, 
then there is a unique element y € H such that 


Ar = (x,y) 


for all x € H. 


rotation An action on Euclidean space induced by a special orthogonal 
matrix. Rotations are often denoted by p. 


round-off error The error that results from rounding off decimals. 
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Runge-Kutta method A numerical method for solving differential equa- 
tions that can be described as follows: Let 


m, = h- f(x, yx) 
h 
mg = nf (m+ dm +) 
h 
m3 = bt (m+ 5m + 2) 
ma = h:- f(trth,yr+ms). 


Then yx+1 is given by 


1 
Yet = Yk + 5m + 2m2 + 2m3 + ma) ‘ 


scaling function The function y in the development of the Haar basis. 


Schwartz distribution <A continuous linear functional on the Schwartz 
space. 


Schwartz space The space of functions 


oF 


x 5,8 PC) < OO, 


— \? € C™(RY) : pa,a() = sup 


a= (Ay-+-,N)s8 = (Bir---s By). 


Schwarz inequality This is the inequality 


I(v, w)| < [lvl - [lw 


second-order, linear equation A differential equation of the form 


ay” + by’ +cy=d. 


separable equation An ordinary differential equation which can be written 
so that all the independent variables are on one side of the equation and all 
the dependent variables are on the other side of the equation. 


simple critical point An isolated critical point at which linearization is a 
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good approximation. 


singular Sturm-Liouville problem A Sturm-Liouville problem in which 
we allow p and q to vanish at the endsoints of the interval of study. 


solution of a differential equation A function, or family of functions, 
that satisfy the differential equation. 


state function In quantum mechanics, the function which represent the 
state of a point in space at time t. 


stationary fluid motion The solution of an autonomous system (which is 
independent of t). 


steady-state solution This is a solution u that does not change with time, 
so that Ou/Ot = 0. 


step function Also known as the heaviside function, this is the function 


Sturm-Liouville problem A differential equation of the form 


& (voy Z) + Drala) + rely =0. 


We typically assume that p and gq are non-vanishing. 


support of a distribution The complement of the union of all open sets 
U such that p(y) = 0 for all elements of C'S° that are supported in U. 


time-independent Schrédinger equation This is the equation 
—2 


h_—2 
Hoe at (V(r) -pwa=0. 


total relative error The quantity 
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tractrix An example of a pursuit curve. 


transcendental function A function that is not polynomial, rational, or a 
root. 


translation An action on Euclidean space induced by adding a fixed con- 
stant to each variable. Translation by a is often denoted Tq. 


translation-invariant operator A linear operator T on functions such 
that 
T(taf)() = (taT f)(2) . 


triangle inequality This is the inequality 


Ilv + wll < [lvl] + Ilwl. 


trigonometric series A series of the form 


1 co 
f(z) = 70 at Ni, @ cosnz + by, sin nc) . 
n=1 
unconditional basis A set of vectors {eo,e1,...} in a Banach space X is 


called an unconditional basis if it has the following properties: 


For each x € X there is a unique sequence of scalars ag, a1,... such that 
co 
t= ) Azje;, 
j=0 


in the sense that the partial sums Sy = 4 aje; converge to x in the 
topology of the Banach space. 


There exists a constant C' such that, for each integer m, for each sequence 
Qo, Q@1,... of coefficients as above, and for any sequence {o, 31,... satisfying 
|Gx| < |ax| for all 0 < k < m, we have 


m 
S> Beer 
k=0 


SG 


m 
s AkEk 
k=0 


undamped simple harmonic motion The motion of a mass attached to 
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a spring with no interference or damping force. 


undetermined coefficients A method of organized guessing used to solve 
differential equations. 
van der Pol equation This is the equation 


x 


d 
Sa tue? 1) +a =0. 


dt 


variation of parameters A solution method for differential equations that 
consists of applying algebraic techniques to the solutions of the associated 
homogeneous equation. 


wave equation This is the partial differential equation from physics that 
says 


wavelet The function w in the development of the Haar basis. 
wavelet basis The collection 

H= {2?/2a)7m :m,j € 2} 
is an orthonormal basis for L?, and will be called a wavelet basis for L?. 


wavelet transform The decomposition of a function f in terms of the 
spaces V;, W;. 


Wronskian determinant If y1, y2 are solutions of a differential equation, 
then their Wronskian is 


w= act ( za 
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